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ENDO BETA-1.4«LUCANASE FROM ASPERGILLUS 



The present invention relates to an enzyme. In addition, the present invention relates to 
a nucleotide sequence coding for the enzyme. Also, the present invention relates to a 
5 promoter, wherein the promoter can be used to control the expression of the nucleotide 
sequence coding for the enzyme. 

In panicular, the enzyme of the present invention is a glucanase enzyme - i.e. an enzyme 
that can degrade j8-l,4-glucosidic bonds. 

10 

It is known that it is desirable to direct expression of a gene of interest ( "GOI") in certain 
tissues of an organism - such as a filamentous fungus (such as Aspergillus niger) or-even 
a plant crop. The resultant protein or enzyme may be useful for the organism itself. For 
example, it may be desirable to produce crop protein products with an optimised amino 

15 acid composition and so increase the nutritive value of a crop, For example, the crop 
may be made more useful as a feed. In the alternative, it may be desirable to isolate the 
resultant protein or enzyme and then use the protein or enzyme to prepare, for example, 
food compositions. In this regard, the resultant protein or enzyme can be a component 
of the food composition or it can be used to prepare food compositions, including altering 

20 the characteristics or appearance of food compositions. 

It may even be desirable to use the organism, such as a filamentous hmgus or a crop 
plant, to express non-plant genes, such as for the same purposes. 

25 Also, it may be desirable to use an organism, such as a filamentous fungus or a crop 
plant, to express manmialian genes. Examples of the latter products include interferons, 
insulin, blood factors and plasminogen activators. 

It is also desirable to use micro-organisms, such as filamentous fungi, to prepare products 
30 from GOIs by use of promoters that are active in the micro-organisms 
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Fruit and vegetable cell walls largely consist of polysaccharide, the major cotnponents 
being pectin, cellulose and xyloglucan, R.R. Selvendran and J. A, Robenson, IFR Report 
1989. Numerous cell wall models have been proposed which attempt to incorporate the 
essential propenies of strength and flexibility (P. Albersheim, Sci. Am. 232, 81-95, 
1975;. P. Albersheim, Plant Biochem. 3rd Edition (Bonner and Vamer), Ac. Press, 
1976; T. Hayashi, Ann. Rev. Plant Physiol. & Plant Mol. Biol., 40, 139-168, 1989). 

The composition of the plant cell wall is complex and variable. Polysaccharides are 
mainly found in the form of long chains of cellulose (the main structural component of 
the plant cell wall), hemicellulose (comprising various B-xylan chains, such as 
xyloglucans) and pectic substances (consisting of galacturonans and rhanmogalacturonans; 
arabinans; and galactans and arabinogalactans). 

In particular, giucans are polysaccharides made up exclusively of glucose subunits. 
Typical examples of giucans are starch and cellulose. 

The enzymes that degrade giucans are collectively referred to as glucanases. A typical 
glucanase is i3-l,4*endoglucanase. 

j3-l,4-endoglucanases have uses in many industries. For example, in the brewing 
industry, barley is used for production of malt, and, in the latter years, as adjunct in the 
brewing process. When the quality of the malt is poor, or barley has been used as an 
adjunct, problems with high viscosity in the wort can arise because of j3-glucans from the 
barley. In this regard, barley contains large quantities of mixed 13-1.3/1,4- giucans of 
very high molecular weight. When dissolved, these giucans produce high viscosity 
solutions, which can cause troubles in some applications. For example, the high viscosity- 
reduces the filterability of the wort and can lead to unaccepuble long filtration times. 
To avoid these problems i3-elucanase has been traditionally added to won to avoid such 
problems - i e the problem with giucans can be avoided by addition of enzymes, m 
particular, glucanases, which degrade the polymers. 



wo 96/29415 PCT/EP96/01008 

3 

Further infonnaiion on these problems may be found in the Grindsted brochure called 
"Glucanase GV\ the reviews by Dr. C.W. Bamforth (Brewers Digest June 1982 pages 
22-28; and Brewers' Guardian September 1985 pages 21-26), and the paper by T.Godfrey 
(Industrial Enzymology The Application of Enzymes in Industry Chapter 4.5 pages 221- 
5 259). 

In the feed industry barley can be used for chicken feed because it is cheap, but again the 
/S-glucan can give problems for the digestion of the chicken. By addition of /3-glucanase 
to the feed the digestibility of the feed can be increased. In addition, the faeces of 
10 chickens feeding on feed containing barley is sticky making it difficult to remove and 
results in dirty eggs. 

WO 93/2019 discusses endo-6-l,4-glucanases (EC no. 3.2.1.4). According to WO 
93/2019, these glucanases are a group of hydrolases which catalyse endo hydrolysis of 
15 1 ,4-fl-D-glycosidic linkages in cellulose, lichenin, cereal B-D-glucans and other plant 
material containing cellulosic parts. Endo-I,4-B-D-glucan 4-gIucano hydrolase is 
sometimes called endo-0-l,4-glucanase. 

The endo-6-.l,4-glucanase of WO 93/2019 exhibits a pH-opiimum of 2.0 to 4.0, an 
20 isoelectric point of 2.0 to 3.5, a molecular weight of between 30,000 and 50.000, and 
a temperature optimum between 30 and 70° C. 

Further teachings on glucans may be found in WO 93/17101, in particular xyloglucans. 

According to WO 93/17101 the xyloglucans are 1,4-6-glucans that have been extensively 
25 substituted with a-L6-xylosyl side chains, some of which are 1 ,2-B-galactosylated. They 

are found in large amounts in the primary cell walls of dicots but also in certain seeds. 

where they serve different roles. Primary cell wall xyloglucan is fucosylaied. 

Xyloglucan is tightly hydrogen bonded to cellulose microfibrils and requires concentraied 

alkali or strong swelling agents to release it. Xyloglucan is thought to form cross-bridges 
30 between cellulose microfibrils, the cellulose/ xyloglucan network forming the major load- 

bearing/elastic network of the wall. DCB mutated suspension culture cells (cell walls 

lacking cellulose) release xyloglucan into their media, suggesting that xyloglucan is 
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normally rightly bound to cellulose. 
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Hydrolysis of primary cell wall xyloglucan has been demonstrated in segments of dark 
grown squash hypocotyls, during lAA induced growth (K. Wakabayashi et al. Plant 
5 Physiol., 95, 1070-1076, 1991). Endohydrolysis of wall xyloglucan is thought to 
contribute to wall loosening which accompanies cell expansion (T. Hyashi, Ann. Rev. 
Plant Physiol. & Plant Mol. Biol., 40, 139-168, 1989). The average molecular weight 
of xyloglucan has also been shown to decrease during tomato fruit ripening and this may 
contribute to the tissue softening which accompanies the ripening process (D.J. Huber, 
10 J. Amer. Soc. Hort. Sci., 108(3), 405-409, 1983). Certain seeds, e g. Nastunium, 
contain up to 30% by weight of xyloglucan, stored in thickened cotyledonary cell walls, 
which serves as a reserve polysaccharide and is rapidly depolymerised durmg 
germination. 

15 It would be useful to increase giucanase activity, for example to have a plant with high 
concentration of giucanase for use in feed, preferably using recombinant DNA 
techniques. 

The present invention seeks to provide an enzyme having giucanase activity; preferably 
20 wherein the enzyme can be prepared in certain or specific cells or tissues, such as in just 
a specific cell or tissue, of an organism, typically a filamentous fungus, preferably of the 
genus Aspergillus, such as Aspergillus niger, or even a plant. 

Also, the present invention seeks to provide a GOI coding for the enzyme that can be 
25 expressed preferably in specific cells or tissues, such as in certain or specific cells or 
tissues, of an organism, typically a filamentous fungus, preferably of the genus 
Aspergillus, such as Aspergillus niger, or even a plant. 

In addition, the present invention seeks to provide a promoter that is capable of direcimg 
30 expression of a GOI, such as a nucleotide sequence coding for the enzyme according to 
the present invention, preferably in certain specific cells or tissues, such as in just a 
specific cell or tissue, of an organism, typically a filamentous fungus, preferably of the 
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genus Aspergillus, such as Aspergillus niger, or even a plant. Preferably, the promoter 
is used in Aspergillus wherein the product encoded by the GOI is excreted from the host 
organism into the surrounding medium. 

5 Furthermore, the present invention seeks to provide constructs, vectors, piasmids, cells, 
tissues, organs and organisms comprising the GOI and/or the promoter, and methods of 
expressing the same, preferably in specific cells or tissues, such as expression in just a 
specific cell or tissue, of an organism, typically a filamentous fungus, preferably of the 
genus Aspergillus, or even a plant. 

10 

According to a first aspect of the present invention there is provided an enzyme 
obtainable from Aspergillus, wherein the enzyme has the following characteristics: a-NfW 
of 24,235 D ± 50 D; a pi value of about 4; glucanase activity; and wherein the 
glucanase activity is endo i3-l,4-glucanase activity. 

15 

According to a second aspect of the present invention there is provided an enzyme having 
the sequence shown as SEQ. LD. No. I or a variant, homologue or fragment thereof 

According to a third aspect of the present invention there is provided an enzyme coded 
20 by the nucleotide sequence shown as SEQ. I. D. No. 2 or a variant, homologue or 
fragment thereof or a sequence complementary thereto. 

According to a fourth aspect of the present invention there is provided a nucleotide 
sequence coding for the enzyme according to the present invention. 

25 

According to a fifth aspect of the present invention there is provided a nucleotide 
sequence having the sequence shown as SEQ. I.D. No. 2 or a variant, homologue or 
fragment thereof or a sequence complementary thereto. 

30 According to a sixth aspect of the present mvemion there is provided a promoter having 
the sequence shown as SEQ. I.D. No. 3 or a variant, homologue or fragment thereof or 
a sequence complemenury thereto. 
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According to a seventh aspect of the present invention there is provided a lerminator 
having the nucleotide sequence shown as SEQ. I.D. No. 13 or a variant, homologue or 
fragment thereof or a sequence complementary thereto, 

S According to an eighth aspect of the present invention there is provided a signal sequence 
having the nucleotide sequence shown as SEQ. I D. No. 14 or a variant, homologue or 
fragment thereof or a sequence complementary thereto. 

According to a ninth aspect of the present invention there is provided a process for 
10 expressing a GOI by use of a promoter, wherein the promoter is the promoter according 
to the present invention. 

According to a tenth aspect of the present invention there is provided the use of an 
enzyme according to the present invention to degrade a glucan. 

15 

According to an eleventh aspect of the present invention there is provided plasmid 
NCIMB 40704. or a nucleotide sequence obtainable therefrom for expressing an enzyme 
capable of degrading arabinoxylan or for controlling the expression thereof or for 
controlling the expression of another GOI, 

20 

According to a twelfth aspect of the present invention there is provided a signal sequence 
having the sequence shown as SEQ. I.D. No. 15 or a variant, homologue or fragment 
thereof. 

25 According to a thirteenth aspect of the present invention there is provided a glucanase 
enzyme having the ability to degrade /3-1.4-glucosidic bonds, which is immunologically 
reactive with an antibody raised against a purified glucanase enzyme havmg the sequence 
shown as SEQ I D. No. 1. 

30 According to a founecnth aspect of the present invention there is provided a promoter 
that is inducible by glucose. 
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According to a fifteenth aspect of the present invention there is provided the use of 
glucose to induce a promoter. 

Other aspects of the present invention include constructs, vectors, plasmids, cells, tissues, 
5 organs and transgenic organisms comprising the aforementioned aspects of the present 
invention. 

Other aspects of the present invention include methods of expressing or allowing 
expression or transforming any one of the nucleotide sequence, the construct, the 
10 plasmid, the vector, the celK the tissue, the organ or the organism, as well as the 
products thereof. 

Additional aspects of the present invention include uses of the promoter for expressing 
GOIs in culture media such as a broth or in a transgenic organism. 

15 

Further aspects of the present invention include uses of the enzyme for preparing or 
treating foodstuffs, including animal feed. 

In the following text, the enzyme of the present invention is sometimes referred to as 
20 Egla enzyme and the coding sequence therefor is sometimes referred to as the Egla gene. 
In addition, the promoter of the present invention is sometimes refeired to as Egla 
promoter. 

Preferably the enzyme is coded by the nucleotide sequence shown as SEQ. I.D. No 2 
25 or a variant, homologue or fragment thereof or a sequence complementary thereto. 

Preferably the nucleotide sequence has the sequence shown as SEQ. I.D. No. 2 or a 
variant, homologue or fragment thereof or a sequence complementary thereto. 



30 



Preferably the nucleotide sequence is opcrativcly linked to a promoter. 
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Preferably the promoter is the promoter having the sequence shown as SEQ. I.D. No. 
3 or a variant, homologue or fragment thereof or a sequence complementary thereto. 

Preferably the promoter of the present invention is operatively linked to a GOI. 

5 

Preferably the GOI comprises a nucleotide sequence according to the present invention. 

In one preferred embodiment, the transgenic organism is a fungus. For example the 
organism can be a yeast, which would then be useful in for example the brewing 
10 industry. 

Preferably the transgenic organism is a filamentous fungus* more preferably of the genus 
Aspergillus. 

IS In another preferred embodiment the transgenic organism is a plant. 

In another preferred embodiment the transgenic organism is a yeast. In this regard, yeast 
have been widely used as a vehicle for heterologous gene expression. The species 
Saccharomyces cerevisiae has a long history of industrial use, including use for 
20 heterologous gene expression. Expression of heterologous genes in Saccharomyces 
cerevisiae has been reviewed by Goodey et al (1987, Yeast Biotechnology, D R Berry 
et al, eds, pp 401-429, Allen and Unwin, London) and by King et al (1989, Molecular 
and Cell Biology of Yeasts, E F Walton and G T Yarronton, eds, pp 107-133, Blackie. 
Glasgow), 

25 

For several reasons Saccharomyces cerevisiae is well suited for heterologous gene 
expression. First, it is non-pathogenic to humans and mcapable of producmg ccnain 
endotoxins. Second, it has a long history of safe use following cenmries of commercial 
exploiuiion for various purposes. This has led to wide public acceptability Third, the 
30 extensive commercial use and research devoted co the organism has resulted in a wealth 
of knowledge about the genetics and physiology as well as large-scale fermentation 
characteristics of Saccharomyces cerevisiae. 
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An additional advantage is that yeasts are capable of post*translational modifications of 
proteins and thereby have the potential for glycosylaiion and/or secretion of heterologous 
gene products into the growth medium. A review of the principles of heterologous gene 
expression in Saccharomyces cerevisiae and secretion of gene products is given by E 
5 Hinchcliffe E Kenny (1993, "Yeast as a vehicle for the expression of heterologous 
genes ", Yeasts, Vol 5. Anthony H Rose and J Stuart Harrison, eds, 2nd edition, 
Academic Press Ltd.). 

The glycosylation of enzymes expressed in yeast is known to increase heat stability of the 
10 enzyme. Enhancing the heat stability of the glucanase according to the present invention 
will make this enzyme suitable for use in the brewing industry and for use in the 
preparation of animal feed, i.e. chicken feed. 

Yeasts are known to secrete very few proteins into the culture medium. This makes yeast 
15 a very attractive host for expression of heterologous genes, since secretable gene products 
can easily be recovered and purified. 

Several types of yeast vectors are available, including integrative vectors, which require 
recombination with the host genome for their maintenance, and autonomously replicating 
20 plasmid vectors. 

In order to prepare the transgenic Saccharomyces, expression constructs are prepared by 
inserting a GOI (such as an amylase or SEQ. ID No 2) into a construct designed for 
expression in yeast. Several types of constructs used for heterologous expression have 
25 been developed. The constructs contain a promoter active in yeast fused to the GOI, 
usually a promoter of yeast origin, such as the GALl promoter, is used, The GOI can 
be fiiscd to a signal sequence which directs the protein encoded by the GOI to be 
secreted. Usually a signal sequence of yeast origin, such as the sequence encoding the 
SUC2 sigiuil peptide, is used. A terminator active in yeast ends the expression system. 

30 

Heterologous expression in yeasi has been reported for several genes. The gene products 
can be glycosylated which is advantageous for some enzymes intended for specific 
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application where heat tolerance is desirable. The proteins can be deposited 
intracellularly if the GOI is not fused to a signal sequence, or they can be secreted 
exiracelluarly if the GOI is equipped with a signal sequence. 

5 For the u-ansformation of yeast several transformation protocols have been developed. 

For example, the transgenic Saccharomyces according to the present invention can be 
prepared by following the teachings of Hinnen et al (1978, Proceedings of the National 
Academy of Sciences of the USA 75, 1929) Beggs, J D (1978, Nature, London, 275, 
10 104); and Ito, H et al (1983, J Bacteriology 153, 163-168). 

The transformed yeast cells are selected using various selective markers. Among the 
markers used for transformation are a number of auxotrophic markers such as LEU2, 
HIS4 and TRPl. and dominant antibiotic resistance markers such as aminoglycoside 
15 antibiotic markers, eg G418. 

Highly preferred embodiments of each of the aspects of the present invention do not 
include any one of the native enzyme, the native promoter or the native nucleotide 
sequence in its natural environment. 

20 

Preferably, in any one of the plasmid, the vector such as an expression vector or a 
transformation vector, the cell, the tissue, the organ, the organism or the transgenic 
organism, the promoter is present in combination with at least one GOI. 

25 Preferably the promoter and the GOI are stably incorporated within the transgenic 
organism*s genome. 

Preferably the transgenic organism is a filamentous fungus, preferably of the genus 
Aspergillus, more preferably Aspergillus niger. Alternatively, the transgenic organism 
30 can be a yeast. The transgenic organism can even be a plant, such as a monocot or dicot 
plant. 
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A highly preferred embodiment is an enzyme obtainable from Aspergillus, wherein the 
enzyme has the following characteristics: a NfW of 24,235 D ± 50 D; a pi value of 
about 4; glucanase activity; and wherein the glucanase activity is endo /3-1,4-glucanase 
activity; wherein the enzyme has the sequence shown as SEQ. I.D. No. 1 or a variant, 
5 homologue or fragment thereof. 

Another highly preferred embodiment is an enzyme obtainable from Aspergillus, wherein 
the enzyme has the following characteristics: a MW of 24,235 D ± 50 D; a pi value of 
about 4; glucanase activity; and wherein the glucanase activity is endo ^-1,4-glucanase 
10 activity; wherein the enzyme is coded by the nucleotide sequence shown as SEQ. I.D. 
No. 2 or a variant, homologue or fragment thereof or a sequence complementary thereto. 

The advantages of the present invention are that it provides a means for preparing a 
glucanase enzyme and the nucleotide sequence coding for the same. In addition, it 
15 provides a promoter that can control the expression of that, or another, nucleotide 
sequence. 

Other advantages of the present invention are that the enzyme can be used to prepare 
useful feeds containing cereals, such as barley, maize, rice etc. 

20 

The present invention therefore provides an enzyme having glucanase activity wherem 
the enzyme can be prepared in certain or specific cells or tissues, such as in just a 
specific cell or tissue, of an organism, typically a filamentous fungus, preferably of the 
genus Aspergillus, such as Aspergillus niger. The enzyme may even be prepared in a 
25 plant. 

Also, the present invention provides a GOI coding for the enzyme that can be expressed 
preferably in specific cells or tissues, such as in certain or specific cells or tissues, of an 
organism, typically a filamentous fungus, preferably of the genus Aspergillus, such as 
30 Aspergillus niger. The GOI may even be expressed in a plant. 
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In addition, the present invention provides a promoter that is capable of directing 
expression of a GOI, such as a nucleotide sequence coding for the enzyme according to 
the present invention, preferably in certain specific cells or tissues, such as in just a 
specific cell or tissue, of an organism, typically a filamentous fungus, preferably of the 
5 genus Aspergillus, such as Aspergillus niger, or even a plant. Preferably, the promoter 
is used in Aspergillus wherein the product encoded by the GOI is excreted from the host 
organism into the surrounding medium. The promoter may even be tailored (if 
necessary) to express a GOI in a plant. 

10 The present invention also provides constructs, vectors, plasmids, cells, tissues, organs 
and organisms comprising the GOI and/or the promoter, and methods of expressing the 
same, preferably in specific cells or tissues, such as expression in just a specific cell or 
tissue, of an organism, typically a filamentous fungus, preferably of the genus 
Aspergillus, or even a plant. 

15 

The terms "variant", "homologue" or "fragment" in relation to the enzyme include any 
substitution of, variation of, modification of, replacement of, deletion of or addition of 
one (or more) amino acid from or to the sequence providing the resultant amino acid 
sequence has glucanase activity, preferably having at least the same activity of the 

20 enzyme shown in the sequence listings (SEQ I.D. No. 1 or 12). In particular, the term 
"homologue" covers homology with respect to structure and/or function providing the 
resultant enzyme has glucanase activity. With respect to sequence homology, preferably 
there is at least 75%, more preferably at least 85%, more preferably at least 90% 
homology to SEQ ID NO. 1 shown in the attached sequence listings. More preferably 

25 there is at least 95%, more preferably at least 98%, homology io SEQ ID NO. I shown 
in the attached sequence listings. 

The terms "variant", "homologue" or "fragment" in relation to the nucleotide sequence 
coding for the enzyme include any substitution of. variation of, modification of. 
30 replacement of, deletion of or addition of one (or more) nucleic acid from or to the 
sequence providing the resultant nucleotide sequence codes for an enzyme having 
glucanase activity, preferably having ai least the same activity of the enzyme shown in 
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the sequence listings (SEQ I.D. No. 2 or 12). In particular, the term '^homologue" 
covers homology with respect to structure and/or function providing the resultant 
nucleotide sequence codes for an enzyme having glucanase activity. With respect to 
sequence homology, preferably there is at least 75%, more preferably at least 85%, more 
5 preferably at least 90% homology to SEQ ID NO. 2 shown in the attached sequence 
listings. More preferably there is at least 95%, more preferably at least 98%, homology 
to SEQ ID NO. 2 shown in the attached sequence listings. 

The terms "variant", "homologue" or "fragment" in relation to the promoter include any 
10 substitution of, variation of, modification of, replacement of, deletion of or addition of 
one (or more) nucleic acid from or to the sequence providing the resultant nucleotide 
sequence has the ability to act as a promoter in an expression system - such as the 
transformed cell or the transgenic organism according to the present invention. In 
particular, the term "homologue" covers homology with respect to structure and/or 
15 function providing the resultant nucleotide sequence has the ability to act as a promoter. 
With respect to sequence homology, preferably there is at least 75%, more preferably at 
least 85%, more preferably at least 90% homology to SEQ ID NO. 3 shown in the 
attached sequence listings. More preferably there is at least 95%, more preferably at 
least 98%, homology to SEQ ID NO. 3 shown in the attached sequence listings. 

20 

The terms "variant", "homologue" or "fragment" in relation to the terminator or signal 
nucleotide sequences include any substitution of, variation of, modification of, 
replacement of, deletion of or addition of one (or more) nucleic acid from or to the 
sequence providing the resultant nucleotide sequence has the ability to act as a terminator 

25 or codes for an amino acid sequence that has the ability to act as a signal sequence 
respectively in an expression system - such as the transformed cell or the transgenic 
organism according to the present invention. In panicular. the term "homologue " covers 
homology with respect to structure and/or function providing the resultant nucleotide 
sequence has the ability to act as or code for a terminator or signal respectively. With 

30 respect to sequence homology, preferably there is at least 75%, more preferably at least 
85%, more preferably at least 90% homology to SEQ ID NO.s 13 and 14 (respectively) 
shown in the anached sequence listings. More preferably there is at least 95%, more 
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preferably at least 98%, homology to SEQ ID NO.s 13 and 14 (respectively) shown in 
the attached sequence listings. 

The tenns "variant*', "homologue" or "fragment" in relation to the signal amino acid 
5 sequence include any substitution of, variation of, modification of, replacement of, 
deletion of or addition of one (or more) amino acid from or to the sequence providing 
the resultant sequence has the ability to act as a signal sequence in an expression system - 
such as the transformed cell or the transgenic organism according to the present 
invention. In particular, the tenn **homologue" covers homology with respect to strucnire 

10 and/or function providing the resultant nucleotide sequence has the ability to act as or 
code for a signal. With respect to sequence homology, preferably there is at least 75%, 
more preferably at least 85%, more preferably at least 90% homology to SEQ ID NO 15 
shown in the attached sequence listings. More preferably there is at least 95%, more 
preferably at least 98%, homology to SEQ ID NO 15 shown in the attached sequence 

15 listings. 

The above terms are synonymous with allelic variations of the sequences. 

The term "complementary" means that the present invention also covers nucleotide 
20 sequences that can hybridise to the nucleotide sequences of the coding sequence, the 
promoter sequence, the terminator sequence or the signal sequence respectively. 

The term "nucleotide" in relation to the present invention includes genomic DNA, 
CDNA, synthetic DNA, and RNA. Preferably it means DNA, more preferably cDNA 
25 for the coding sequence of the present invention since the genomic coding sequence has 
two inirons and their removal would allow expression in bacteria. 

The term "construct" - which is synonymous with terms such as "conjugate ", "cassette" 
and "hybrid" - includes a GOI directly or indirectly attached to a promoter. An example 
30 of an indirect attachment is the provision of a suitable spacer group such as an mtron 
sequence, such as the ShlAmron or the ADH mtron. intermediate the promoter and the 
GOI. The same is true for the term "fused" in relation to the present invention which 
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includes direct or indirect attachment. In each case, it is highly preferred that the terms 
do not cover the natural combination of the gene coding for the enzyme ordinarily 
associated with the wild type gene promoter and when they are both in their natural 
environment. A highly preferred embodiment is the or a GOI being operatively linked 
5 to a or the promoter. 

The construct may even contain or express a marker which allows for the selection of the 
genetic construct in, for example, a filamentous fungus, preferably of the genus 
Aspergillus, such as Aspergillus niger, or plants, preferably cereals, such as maize, rice, 
10 barley etc., into which it has been transferred. Various markers exist which may be 
used, such as for example those encoding mannose-6-phosphate isomerase (especially for 
plants) or those markers that provide for antibiotic resisunce - e.g. resistance to 0418, 
hygromycin, bleomycin, kanamycin and gentamycin. 

15 The term "vector" includes expression vectors and transformation vectors. 

The term "expression vector" means a construct capable of in vivo or in vitro expression. 

The term "transformation vector" means a consuruct capable of being transferred from 
20 one species to anodier - such as from an Exoli plasmid to a filamentous fungus, 
preferably of the genus Aspergillus. It may even be a construct capable of bemg 
transferred from an E,coli plasmid to an Agrobacterium to a plant. 

The term ''tissue" includes tissue per se and organ. 

25 

The term "organism* in relation to the present invention includes any organism that could 
comprise the promoter according to the present invention and/or the nucleotide sequence 
coding for the enzyme according to the present invention and/or products obtained 
therefrom, wherein the promoter can allow expression of a GOI and/or wherein the 
30 nucleotide sequence according to the present invention can be expressed when present m 
the orgamsm. 
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Preferably the organism is a filamentous fungus, preferably of the genus Aspergillus, 
more preferably Aspergillus niger. 

The term "transgenic organism" in relation to the present invention includes any organism 
5 that comprises the promoter according to the present invention and/or the nucleotide 
sequence coding for the enzyme according to the present invention and/or products 
obtained therefrom, wherein the promoter can allow expression of a GOI and/or wherein 
the nucleotide sequence according to the present invention can be expressed within the 
organism. Preferably the promoter and/or the nucleotide sequence is (are) incorporated 
10 in the genome of the organism. Preferably the transgenic organism is a filamentous 
fungus, preferably of the genus Aspergillus^ more preferably Aspergillus niger. 

Therefore, the transgenic organism of the present invention includes an organism 
comprising any one of, or combinations of, the promoter according to the present 

15 invention, the nucleotide sequence coding for the enzyme according to the present 
invention, constructs according to the present invention, vectors according to the present 
invention, plasmids according to the present invention, cells according to the present 
invention, tissues according to the present invention or the products thereof For example 
the u-ansgenic organism can comprise a GOI, preferably an exogenous nucleotide 

20 sequence, under the control of the promoter according to the present invention. The 
transgenic organism can also comprise the nucleotide sequence coding for the enzyme of 
the present invention under the control of a promoter, which may be the promoter 
according to the present invention. 

25 In a highly preferred embodiment, the transgenic organism does not comprise the 
combination of the promoter according to die present invention and the nucleotide 
sequence coding for the enzyme according to the present invention, wherein both the 
promoter and the nucleotide sequence are native to that organism and are in their natural 
environment. Thus, in these highly preferred embodiments, the present invention does 

30 not cover the native nucleotide coding sequence according to the present invention m its 
natural environment when it is under the control of its native promoter which is also in 
its namral environment. In addition, in this highly preferred embodiment, the present 
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invention does not cover the native enzyme according to the present invention when it is 
in its natural environment and when it has been expressed by its native nucleotide coding 
sequence which is also in its natural environment and when that nucleotide sequence is 
under the control of its native promoter which is also in its natural environment. 

5 

The term "promoter" is used in the normal sense of the art, e.g. an RNA polymerase 
binding site in the Jacob-Mond theory of gene expression. 

In one aspect, the promoter of the present invention is capable of expressing a GOI, 
10 which can be the nucleotide sequence coding for the enzyme of the present invention. 

In another aspect, the nucleotide sequence according to the present invention is under the 
control of a promoter that allows expression of the nucleotide sequence. In this regard, 
the promoter need not necessarily be the same promoter as that of the present invention. 
IS In this aspect, the promoter may be a cell or tissue specific promoter. If, for example, 
the organism is a plant then the promoter can be one that affects expression of the 
nucleotide sequence in any one or more of stem, sprout, root and leaf tissues. 

By way of example, the promoter for the nucleotide sequence of the present invention can 
20 be the a-Amy 1 promoter (otherwise known as the Amy 1 promoter, the Amy 637 
promoter or the a-Amy 637 promoter) as described in our co-pending UK patent 
application No, 9421292.5 filed 21 October 1994. That promoter comprises the sequence 
shown in Figure L 

25 Alternatively, the promoter for the nucleotide sequence of the present invention can be 
the a-Amy 3 promoter (otherwise known as the Amy 3 promoter, the Amy 351 promoter 
or the a-Amy 351 promoter) as described in our co-pending UK patent application No 
9421286.7 filed 21 October 1994. That promoter comprises the sequence shown in 
Figure 2. 

30 

Preferably, the promoter is the promoter of the present invention. 
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In addition to the nucleotide sequences described above, the promoters, panicularly that 
of the present invention, could additionally include features to ensure or to increase 
expression in a suitable host. For example, the features can be conserved regions such 
as a Pribnow Box or a TATA box. The promoters may even contain other sequences to 

5 affect (such as to maintain, enhance, decrease) the levels of expression of the GOI. For 
example, suitable other sequences include the 5/ii-intron or an ADH intron. Other 
sequences include inducible elements - such as temperature, chemical, light or stress 
inducible elements. Also, suitable elements to enhance transcription or translation may 
be present. An example of the latter element is the TMV 5' signal sequence (see Sleat 

10 Gene 217 [1987] 217-225; and Dawson Plant Mol. Biol. 23 [1993] 97). 

In addition the present invention also encompasses combinations of promoters and/or 
nucleotide sequences coding for proteins or enzymes and/or elements. For example, the 
present invention encompasses the combination of a promoter according to the present 
IS invention operatively linked to a GOI, which could be a nucleotide sequence according 
to the present invention, and another promoter such as a tissue specific promoter 
operatively linked to the same or a different GOI. 

The present invention also encompasses the use of promoters to express a nucleotide 
20 sequence coding for the enzyme according to the present invention, wherein a part of the 
promoter is inactivated but wherein the promoter can still function as a promoter. Partial 
inactivation of a promoter in some instances is advantageous. 

In panicular. with the Amy 351 promoter mentioned earlier it is possible to inactivate a 
25 part of it so that the panially inactivated promoter expresses GDIs in a more specific 
manner such as in just one specific tissue type or organ. 

The term "inactivated" means partial inactivation in the sense that the expression pattern 
of the promoter is modified but wherein the panially inactivated promoter still functions 
30 as a promoter However, as mentioned above, the modified promoter is capable of 
cxpressmg a GO! in at least one (but not all) specific tissue of the original promoter. 
One such promoter is the Amy 351 promoter described above. 
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Examples of panial inactivation include altering the folding pattern of the promoter 
sequence, or binding species to parts of the nucleotide sequence, so that a part of the 
nucleotide sequence is not recognised by, for example, RNA polymerase. Another, and 
preferable, way of partially inactivating the promoter is to truncate it to form fragments 
5 thereof. Another way would be to mutate at least a part of the sequence so that the RNA 
polymerase can not bind to that part or another part. 

Another modification is to mutate the binding sites for regulatory proteins for example 
the CreA protein known from filamentous iiingi to exert carbon catabolite repression, and 
10 thus abolish the catabolite repression of the native promoter. 

The term "GOF with reference to the present invention means any gene of interest. A 
GOI can be any nucleotide that is either foreign or natural to the organism (e.g. 
filamentous fungus, preferably of the genus Aspergillus, or a plant) in question. Typical 

IS examples of a GOI include genes encoding for proteins and enzymes that modify 
metabolic and caubolic processes. The GOI may code for an agent for introducing or 
increasing pathogen resistance. The GOI may even be an antisense construct for 
modifying the expression of namral transcripts present in the relevant tissues. The GOI 
may even code for a non-natural protein of a filamentous fungus, preferably of the genus 

20 Aspergillus, or a compound that is of benefit to animals or hunfians. 

For example, the GOI could code for a pharmaceutically active protein or enzyme such 
as any one of the therapeutic compounds insulin, interferon, human serum albumin, 
human growth factor and blood clotting factors. In this regard, the transformed cell or 

25 organism could prepare acceptable quantities of the desired compound which would be 
easily retrievable from, the cell or organism. The GOI may even be a protein giving 
nutritional value to a food or crop. Typical examples include plant proteins that can 
inhibit the formation of anti-nutritive factors and plant proteins that have a more desirable 
amino acid composition (e.g. a higher lysine content than a non-transgenic plant). The 

30 GOI may even code for an enzyme that can be used in food processing such as chymosm, 
thaumatin and ct-galactosidase The GOI can be a gene encoding for any one of a pest 
toxin, an antisense aanscripc such as that for patatm or a-amylase, ADP-glucose 
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pyrophosphorylase (e.g. see EP-A-0455316), a protease antisense or a glucanase. 

The GOI can be the nucleotide sequence coding for the ct-amylase enzyme which is the 
subject of our co-pending UK patent application 9413439.2 filed on 4 July 1994, the 
5 sequence of which is shown in Figure 3 . The GOI can be the nucleotide sequence coding 
for the of-amylase enzyme which is the subject of our co-pending UK patent application 
9421290.9 filed on 21 October 1994. the sequence of which is shown in Figure 4. The 
GOI can be any of the nucleotide sequences coding for the ADP-glucose 
pyrophosphorylase enzymes which are the subject of our co-pending PCT patent 
10 application PCT/EP94/01082 filed 7 April 1994, the sequences of which are shown in 
Figures 5 and 6. The GOI can be any of the nucleotide sequences coding for the a- 
glucan lyase enzyme which are described in our co-pending PCT patent application 
PCT/EP94/03397 filed 15 October 1994, the sequences of which are shown in Figures 
7-10. 

15 

In one preferred embodiment, the GOI is a nucleotide sequence coding for the enzyme 
according to the present invention. 

As mentioned above, a prefened host organism is of the genus Aspergillus, such as 
20 Aspergillus niger. 

The transgenic Aspergillus according to the present invention can be prepared by 
following the teachings of RambosekJ. and LeachJ. 1987 (Recombinant DNA in 
filamentous fungi: Progress and Prospects. CRC Crit. Rev. Biotechnol. 6:357-393), 

25 Davis R.W. 1994 (Heterologous gene expression and protein secretion in Aspergillus. In: 
Martinelli S.D. , Kinghom J.R.( Editors) Aspergillus: 50 years on. Progress in industrial 
microbiology vol 29. Elsevier Amsterdam 1994. pp 525-560), Ballance.D.J. 1991 
(Transformation systems for Filamentous Fungi and an Overview of Fungal Gene 
structure. In :Leong.S. A., Bcrka R.M. (Editors) Molecular Industrial Mycology. Systems 

30 and Applications for Filamentous Fungi. Marcel Dekker Inc. New York 1991. pp 1-29) 
and Turner G. 1994 (Vectors for genetic manipulation. In: Manmelli S.D., Kmghom 
J.R.( Editors) Aspergillus: 50 years on. Progress in industrial microbiology vol 29 
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Elsevier Amsterdam 1994. pp. 641-666). However, the following commentary provides 
a summary of those teachings for producing transgenic Aspergillus according to the 
present invention. 

5 Filamentous fungi have during almost a century been widely used in industry for produc- 
tion of organic compounds and enzymes. Traditional Japanese koji and soy fermentations 
have used Aspergillus sp. for hundreds of years. In this century Aspergillus niger has 
been used for production of organic acids panicular citric acid and for production of 
various enzymes for use in industry. 

10 

There are two major reasons for that filamentous fungi have been so widely used in 
industry. First filamentous fungi can produce high amounts of extracellular products, for 
example enzymes atvd organic compounds such as antibiotics or organic acids. Second 
filamentous fimgi can grow on low cost substrates such as grains, bran, beet pulp etc. 
IS The same reasons have made filamentous fungi attractive organisms as hosts for 
heterologous expression according to the present invention. 

In order to prepare the transgenic Aspergillus, expression constructs are prepared by 
inserting a GOI (such as an amylase or SEQ. I.D. No. 2) into a construct designed for 
20 expression in filamentous fungi. 

Several types of constructs used for heterologous expression have been developed. The 
constructs contain the promoter according to the present invention (or if desired another 
promoter if the GOI codes for the enzyme according to the present invention) which is 

25 active in fungi. Examples of promoters other than that of the present invention include 
a fungal promoter for a highly expressed extracellulary enzyme, such as the glucoamylase 
promoter or the a-amylase promoter. The GOI can be fused to a signal sequence (such 
as that of the present invention or another suitable sequence) which directs the protein 
encoded by the GOI to be secreted. Usually a signal sequence of fungal origin is used. 

30 such as that of the present invention. A terminator active in fungi ends the expression 
system, such as that of the present invention. 



10 
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Another type of expression system has been developed in ftingi where the GOI is fused 
to a smaller or a larger part of a ftmgal gene encoding a stable protein. This can stabilize 
the protein encoded by the GOI. In such a system a cleavage site, recognized by a 
specific protease, can be introduced between the fungal protein and the protein encoded 
by the GOI, so the produced fusion protein can be cleaved at this position by the specific 
protease thus liberating the protein encoded by the GOI ("POI"). By way of example, 
one can introduce a site which is recognized by a KEX-2 like peptidase found in at least 
some Aspergilli. Such a fusion leads to cleavage in vivo resulting in protection of the 
POI and production of POI and not a larger fusion protein. 



Heterologous expression in Aspergillus has been reported for several genes coding for 
bacterial, fungal, vertebrate and plant proteins. The proteins can be deposited 
intracellularly if the GOI is not fused to a signal sequence. Such proteins will accumulate 
in the cytoplasm and will usually not be glycosylated which can be an advantage for some 
15 bacterial proteins. If the GOI is equipped with a signal sequence the protein will 
accumulate extracellulary. 



With regard to product stability and host strain modifications, some heterologous proteins 
are not very stable when they are secreted into the culture fluid of fungi. Most fungi 
20 produce several exu^cellular proteases which degrade heterologous proteins. To avoid 
this problem special fungal strains with reduced protease production have been used as 
host for heterologous production. 

For the uansformation of filamentous fungi, several transformation protocols have been 
25 developed for many filamentous fungi (Ballance 1991, ibid). Many of them are based 
on preparation of protoplasts and introduction of DN A into the protoplasts using PEG and 
Ca-" ions. The transformed protoplasts then regenerate and the transformed fungi are 
selected using various selective markers. Among the markers used for transformation are 
a number of auxotrophic markers such as argB. (rpC, nioD and pyrG, antibiotic 
30 resistance markers such as bcnomyl resistance, hygromycin resistance and phleomycin 
resistance. A very common used transformaiion marker is the amdS gene of A. mdulans 
which in high copy number allows the fungus to grow with acr\'lamide as the sole 
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nitrogen source. 

Even though the enzyme, the nucleotide sequence coding for same and the promoter of 
the present invention are not disclosed in EP-B-0470145 and CA-A-2006454, those two 
5 documents do provide some useful background commentary on the types of techniques 
that may be employed to put the present invention into practice. Some of these 
background teachings are now included in the following conunentary. 

The basic principle in the construction of genetically modified plants is to insert genetic 
10 information in the plant genome so as to obtain a stable maintenance of the insened 
genetic material. Several techniques exist for inserting the genetic information, the two 
main principles being direct introduction of the genetic information and introduction of 
the genetic information by use of a vector system. 

15 A review of the general techniques tnay be fouiKl in articles by Potrykus ( Annu Rev Plant 
Physiol Plant Mol Biol [1991] 42:205-225) and Christou (Agro-Food-Industry Hi-Tech 
March/ April 1994 17-27). 

Thus, in one aspect, the present invention relates to a vector system which carries a 
20 promoter or nucleotide sequence or construct according to the present invention and 
which is capable of introducing the promoter or nucleotide sequence or construct into the 
genome of an organism, such as a plant. 

The vector system may comprise one vector, but it can comprise two vectors. In the case 
25 of two vectors, the vector system is normally referred to as a binary vector system. 
Binary vector systems are described in further detail in Gynheung An et al. (1980). 
Binary Vectors, Plant Molecular Biology Manual A3, 1-19. 

One extensively employed system for transformation of plant cells with a given promoter 
30 or nucleotide sequence or construct is based on the use of a Ti plasmid from 
Agrobactenum tumefaciens or a Ri plasmid from Agrobaaehum rhizogenes An ei al 
(1986), Plant Physiol, 81, 301-305 and Butcher D.N, et al. (1980), Tissue Culture 
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Methods for Plant Pathologists, eds.: D.S. Ingrams and J. P. Helgeson, 203-208. 

Several different Ti and Ri plasmids have been constructed which are suitable for the 
construction of the plant or plant cell constructs described above. A non-limiting 
example of such a Ti plasmid is pGV3850, 

The promoter or nucleotide sequence or construct of the present invention should 
preferably be inscned into the Ti-plasmid between the terminal sequences of the T-DNA 
or adjacent a T-DNA sequence so as to avoid disruption of the sequences immediately 
surrounding the T-DNA borders, as at least one of these regions appear to be essential 
for insertion of modified T-DNA into the plant genome. 

As will be understood from the above explanation, if the organism is a plant, then the 
vector system of the present invention is preferably one which contains the sequences 
necessary to infect the plant (e.g. the v/r region) and at least one border part of a T-DNA 
sequence, the border part being located on the same vector as the genetic construct. 

Furthermore, the vector system is preferably an Agrobacterium tumefaciens Ti-plasmid 
or an Agrobacterium rhizogenes Ri-plasmid or a derivative thereof, as these plasmids are 
well-known and widely employed in the construction of transgenic plants, many vector 
systems exist which are based on these plasmids or derivatives thereof. 

In the construction of a uansgenic plant the promoter or nucleotide sequence or construct 
of the present invention may be first constructed in a microorganism in which the vector 
can replicate and which is easy to manipulate before insertion into the plant. An example 
of a useful microorganism is £. coU, but other microorganisms having the above 
propenies may be used. When a vector of a vector system as defined above has been 
constructed in £. coli, it is transferred, if necessary, into a suitable Agrobacterium strain, 
e.g. Agrobacterium tumefaciens. The Ti-plasmid harbouring the promoter or nucleotide 
sequence or construct of the invention is thus preferably transferred into a suitable 
Agrobacterium strain, e.g. A. tumefaciens. so as to obtain an Agrobacterium cell 
harbouring the promoter or nucleotide sequence or construct of the mvention, which 
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DNA is subsequently transferred into the plant cell to be modified. 

As reported in CA-A-2006454, a large amount of cloning vectors are available which 
contain a replication system in E. coli and a marker which allows a selection of the 
5 transformed cells. The vectors contain for example pBR 322, pUC series, M13 mp 
series, pACYC 184 etc. In such a way, the nucleotide or construct or promoter of the 
present invention can be introduced into a suitable restriction position in the vector. The 
contained plasmid is used for the transformation in E.coU. The £. coli cells are cultivated 
in a suitable nutrient medium and then harvested and lysed. The plasmid is then 

10 recovered. As a method of analysis there is generally used sequence analysis, restriction 
analysis, electrophoresis and further biochemical-molecular biological methods. After 
each manipulation, the used DNA sequence can be restricted and connected with the-next 
DNA sequence. Each sequence can be cloned in the same or different plasmid. After 
each introduction method of the desired promoter or construct or nucleotide sequence 

15 according to the present invention in the plants the presence and/or insenion of funher 
DNA sequences may be necessary. If, for example, for the transformation the Ti- or Ri- 
plasmid of the plant cells is used, at least the right boundary and often however the right 
and the left boundary of the Ti- and Ri-plasmid T-DNA. as flanking areas of the 
introduced genes, can be connected. The use of T-DNA for the transformation of plant 

20 cells has been intensively studied and is described in EP-A-120S16; Hoekema, in: The 
Binary Plant Vector System Offset-dmkkerij Kanters B.B., Alblasserdam, 1985. Chapter 
V; Fraley, et al., Crit. Rev. Plant Sci., 4:1-46; and Anet al., EMBO J. (1985) 4:277- 
284. 

25 Direct infection of plant tissues by Agrobaaerium is a simple technique which has been 
widely employed and which is described in Butcher D.N, et al. (1980), Tissue Culture 
Methods for Plant Pathologists, eds.: D.S. Ingrams and J.P. Heigeson, 203-208. For 
funher teachings on this topic see Potrykus (Annu Rev Plant Physiol Plant Mol Biol 
[ 199 1] 42:205-225) and Christou ( Agro-Food.Industr>' Hi-Tech March/ April 1994 1 7-27) . 

30 With this technique, infection of a plant may be done on a certain part or tissue of the 
plant, i.e. on a pan of a leaf» a root, a stem or another pan of the plant. 
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Typically, with direct infection of plant tissues by Agrobacterium carrying the promoter 
and/or the GOI, a plant to be infected is wounded, e.g. by cutting the plant with a razor 
or puncturing the plant with a needle or rubbing the plant with an abrasive. The wound 
is then inoculated with the Agrobacterium. The inoculated plant or plant pan is then 
5 grown on a suitable culture medium and allowed to develop into mature plants. 

When plant cells are constructed, these cells may be grown and maintained in accordance 
with well-known tissue culcuring methods such as by culturing the cells in a suitable 
culture medium supplied with the necessary growth factors such as amino acids, plant 
10 hormones, vitamins, etc. Regeneration of the transformed cells into genetically modified 
plants may be accomplished using known methods for the regeneration of plants from cell 
or tissue cultures, for example by selecting transformed shoots using an antibiotic and by 
subculturing the shoots on a medium containing the appropriate nutrients, plant 
hormones, etc. 

15 

Further teachings on plant transformation may be found in EP-A-0449375. 



In summation, the present invention provides a glucanase enzyme and the nucleotide 
sequence coding for the same. In addition, it provides a promoter that can control the 
20 expression of that, or another, nucleotide sequence. In addition it includes terminator and 
signal sequences for the same. 



The following sample was deposited in accordance with the Budapest Treaty at the 
recognised depositary The National Collections of Industrial and Marine Bacteria Limited 
25 (NCIMB) at 23 St. Machar Drive, Aberdeen, Scotland, United Kingdom, AB2 IRY on 
16 January 1995: 

Exoli containing plasmid pEGLA-3 {i.e. E.coli DH5a-pEGLA-3} . The deposit 
number is NCIMB 40704. 

30 

The present invention will now be described by way of example. 
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In the following Examples reference is made to the accompanying Figures in which 



Figures 1-10 are sequences of promoters and GOIs of earlier patent applications that are 
useful for use with the aspects of the present invention; 

5 

Figure 11 is a plasmid map of plasmid pEGLA-3; 
Figure 12 is a schematic diagram of some promoter deletions; 
10 Figure 13 is a plasmid map of pGPAMY; 
Figure 14 is a graph; 

Figure 15 is a plasmid map of pGP-GssAMY-Hyg; 

15 

Figure 16 is a graph; and 

Figure 17 is a Western Blot. 

20 The following Examples discuss recombinant DNA techniques. General teachings of 
recombinant DNA techniques may be found in Sambrook.J., Fritsch, E.F., Maniatis T. 
(Editors) Molecular Cloning. A laboratory manual. Second edition. Cold Spring Harbour 
Laboratory Press. New York 1989. 

25 Purification of the /3-glucanase 

Aspergillus niger 3M43 was grown in medium containing wheat bran and beet pulp. The 
fermentation broth was separated from the solid part of the broth by filtration. 
Concentrated fermentation broth was then loaded on a 25X100mm Q-SEPHAROSE 
30 (Pharmacia) high Performance column, equilibrated with 20 mM Tris. HCl pH 7.5. and 
a linear gradient from 0-500 mM NaCl was performed and fractions of the eluaie was 
collected. The i3-glucanasc eluted at ca 100 mM NaCl. The fractions containing 
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glucanase were combined and desalted using a 50x200 mm G-25 SEPHAROSE Superfine 
(Pharmacia). The column was then eluted with distilled water. After desalting the 
enzyme was concentrated using High-Trap spin columns. 

5 Next the concentrated and desalted fractions were subjected to gel filtration on a 50x600 
mm SUPERDEX 50 column. The sample was loaded and the column was eluted with 
0.2 M Phosphate buffer pH 7.0 plus 0.2 M NaCl, and fractions of the eluate were 
collected. The fractions containing glucanase were combined and desalted and 
concentrated as described above. 

10 

The combined fractions were loaded on a 16X100 mm PhenylSEPHAROSE High 
Performance column (Pharmacia), equilibrated with 50 mM Phosphate buffer pH'6,0, 
containing 1.5 M (NH4)2S04. A gradient where the (NH4)2S04 concentration was varied 
from 1.5 - 0 M was applied and the eluate collected in fractions. The fractions 
15 containing glucanase were combined. The purity of the /3-1,4-glucanase was evaluated 
SDS-PAGE using the Phast system gel (Pharmacia). 

Characterization 

20 The molecular weight of the purified glucanase was determined by mass spectrometry 
using laser desorption technology. The MW of the glucanase was found to be 24.235 D 
± 50 D. 

The pi value was determined by use of a Broad pi Kit (Pharmacia). The glucanase has 
25 a pi value of about 4. 

After SDS-PAGE analysis, u-caimcnt PAS reagent showed diat the glucanase is not 
glycosylated. The PAS staining was done according to the procedure of I. Van- 
Seuningcn and M. Davril (1992) Electrophoresis 13 pp 97-99. 

30 
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Amino acid sequencing of the jS-glucanase 

The enzyme was digested with endoproteinase Lys-C sequencing grade from Boehringer 
Mannheim using a modification of the method described by Stone & Williams 1993 
5 (Stone, K.L. and Williams, K.R. (1993). Enzymatic digestion of Proteins and HPLC 
Peptide Isolaiion. in ; Matsudaira P. (Editor). A practical Guide to Protein and Peptide 
Purification for Microsequencing. Second Edition. Academic Press, San Diego 1993. pp 
45-73). 

10 Freeze dried iS-glucanase (0,4 mg) was dissolved in 50 /xl of 8M urea, 0.4 M NH4HCO3, 
pH 8.4. After overlay with Nj and addition of 5 /xl of 45 mM DTT, the protein was 
denatured and reduced for 15 min at 50^C under Nj. After cooling to RT, 5 ^1 of 100 
mM iodoaceiamide was added for the cysteines to be derivatised for 15 min at RT in the 
dark under N2. Subsequently. 90 ^1 of water and 5 /xg of endoproteinase Lys-C in 50 fil 

15 of 50 mM Tricine and 10 mM EDTA, pH 8.0, was added and the digestion was carried 
out for 24h at 37^C under Nj. The resulting peptides were separated by reversed phase 
HPLC on a VYDAC C18 column (0.46 x 15 cm: 10 (im: The Separations Group; 
California) using solvent A: 0.1% TFA in water and solvent B: 0.1% TFA in 
acetonitrile. Selected peptides were rechromatographed on a Dcvelosil CIS column (0.46 

20 X 10 cm; 3/xm) using the same solvent system prior to sequencing on an Applied 
Biosystems 476A sequencer using pulsed-liquid fast cycles. 

The following peptide sequences were found: 

25 SEQ I.D. No. 4 
SEQI.D. No. 5 
SEQ I.D. No. 6 
SEQ I.D. No. 7 
SEQ I.D, No. 8 
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Isolation of a PCR clone of a fragment of the gene 
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PCR primers were synthesised using an Applied Biosysiems DNA synthesiser model 392. 
In this regard. PCR primers were synthesized from two of the found peptide sequences, 
WEVWYGT from Seq I.D. No. 4 and WTWSGG from Seq I.D. No. 7. The primer 
derived from WEVWYGT (reversed^ is shown as Seq I.D. No. 9 and the primer derived 
from WTWSGG is shown as Seq I.D. No. 10 - see below: 

SEQ. I.D. No. 10 

TGG ACN TGG WSN GGN GG 

17 mcr 256 mixmre 

SEQ. I.D, No. 9 

CTN CCR TAG CAN ACY TCC CA 
20 mer 64 mixture 

PCR amplification was performed with 100 pmol of each of these primers in 100 /xl 
reactions using the Amplitaq II kit (Perkin Elmer). The program was: 

STEP TEMP TIME 



1 


94'C 


2 min 


2 


94"'C 


1 min 


3 


55'C 


2 min 


4 


72'C 


2 min 


5 


72"C 


S min 


6 




SOAK 



Steps 2-4 were repeated for 40 cycles. 

The program was run on a PERKIN ELMER DNA Thermal Cyder. 
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A 350 bp amplified fragment was isolated and cloned into a pT7-Blue T-vector according 
to the manufacturer's instructions (Novagen). A fragment was isolated and sequenced. 
The found sequence showed that it was indeed a part of the glucanase gene. 

5 Isolation of A. niger genomic DNA 

Ig. of frozen A. niger mycelium was ground in a mortar under liquid nitrogen. Follow- 
mg evaporation of the nitrogen cover, the ground mycelium was extracted with 15ml of 
an extraction buffer (lOOmM Tris-HCl. pH 8.0, O.SOmM EDTA, 500mM NaCl, lOmM 

10 B-mercaptoethanol) containing Iml 20% sodium dodecyl sulphate. After incubation at 
65^C for 10 min. 5ml 5M KAc. pH 5.0, was added and the mixture funher incubated, 
after mixing, on ice for 20 mins. The mixture was then ccmrifligcd for 20 mins. and the 
supernatant mixed with 0.6 vol. isopropanoi to precipiute the extracted DNA. After 
further centriftigaiion for 15 mins. the DNA pellet was dissolved in 0.7 ml TE (lOmM 

15 Tris, HCl pH 8.0, ImM EDTA) and precipitated with 75 ^1 3M NaAc. pH 4.8, and 500 
ixl isopropanoi. After centrifugation die pellet was washed with 70% ETOH and dried 
under vacuum. The DNA was dissolved in 200 fil TE and stored at -20°C. 

Construction of a library 

20 

20 genomic DNA was panly digested with Tsp509I, which gives ends which are 
compatible with EcoRI ends. The digested DNA was separated on a 1 % agarose gel and 
fragments of 4-10 kb was purified. A XZAPII EcoRI/CIAP kit from Stratagenc was used 
for library construction according to the manufacnirers instructions. 2 ^1 of the ligation 
25 (totally 5 ^1) was packed with Gigapack Gold II packing extract according to the 
manufacturer's instuctions (Stratagene). The library contained 650.000 independent 
clones. 
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Screening of the library 

2 X 50.000 pfu was plated on NZY plates (5g NaCI, 2mg MgSO^.TH.O, 5g yeast extract, 
lOg casein hydrolysate, 15 g agar per liter) and plaquelifts were done on Hybond N 

5 sheets (Amcrsham). The sheets were hybridized with the PGR clone labelled with ^-P 
dCTP (Amersham) using Ready-to-go labelling kit from Pharmacia. The plaquelifts and 
hybridization were done in duplicate and positive clones were reckoned only when 
hybridization could be detected on both sheets. The nucleotide sequence of the present 
invention was sequenced using a ALF-laser fluorescence sequencer (Pharmacia). The 

10 sequence contained all the found amino acid sequence, confirming that the isolated gene 
indeed encoded the /}-l,4-endoglucanase. 

Sequence information 

15 SEQ, ID. No. 12 presents the promoter sequence, the enzyme coding sequence, the 
terminator sequence and the signal sequence and the amino acid sequence of the enzyme 
of the present invention. 

Testing enzyme activity 

20 

The purified protein was assayed for endo 0-1 A glucanase activity using Azurine- 
crosslinked barley /3-glucan tablet (trade name: Glucazyme tablets supplied by Megazy- 
me. Australia) by the insttxictions given by the manufacmrer. 

25 The purified enzyme gave a high activity on this substrate. Typically the enzyme has a 
specific activity of 2250 micromol glucose per min per mg of protein. 

Induction of the EglA gene; identification of inducing carbon source 



30 



The Table below shows the identification of a number of high and low molecular weight 
inducers of the glucanase promoter. This analysis was carried out using the full length 
glucanase promoter of the present invention fused to the £ coli /J-glucuronidase gene 
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The inducing strength of different carbon sources was determined quantitatively by 
measuring the intracellular GUS specific activity to hydrolyse p-nitrophcnol giucuronide. 



CARBON SOURCE 

(1%) 



GUS ACTIVITY 

(units/mg)- 24 hours 



10 



15 



20 



25 



xylose 

xyliiol 

arabinose 

arabitol 

glucose 

cellubiose 

xylo-oiigomer 70 

glucopyranoside 

methyl-xylopyranoside 

xyloglucan 

pectin 

arabinogalactan 
arabitol + glucose 



12.91 

10.62 

8.50 

14.40 

20.25 

19.44 

11.80 

19.70 

12.60 

13,90 

9.70 

30.20 

29.50 



Surprisingly glucose, which is normally a potent catabohte repressor, induces the 
giucanase promoter. 

Accordingly, the present invention also relates to a promoter diat is inducible by glucose. 
In addition, the present invention relates to the use of glucose to induce a promoter. 



These aspects of the present invention are different to the teachings of WO 94/04673 
which discloses a fungal promoter that is active in the presence of glucose. In this 
30 regard, the promoter of the present invention is not only active in the presence of glucose 
but that it is also inducible by glucose. 
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One of the advantages of having a glucanase promoter that is inducible by glucose is that 
the promoter can be used to express a GOI. and thereby be used to prepare a POI (such 
as an heterologous POI). in a glucose containing environment. This is important because 
glucose is one of preferred carbon sources for biomass accumulation. In addition, 

5 glucose containing media are expected to produce lower amounts of proteases, thereby 
providing better yields of the POI. In addition, the use of a derepressed promoter in a 
derepressed host strain will increase the speed and efficiency of reaction media, such as 
a fermentation reaction medium. In addition, the use of mixed carbon sources during 
fermentation will allow the efficient induction of this promoter, for example at low levels 

10 of glucose and a cheap carbon source (e.g. sugar beet pulp). 

Effects of promoter deletions on the regulation of the expression of the glucanase 
gene 

IS A series of deletion studies, which are shown in Figure 12, were performed. In these 
studies, the different promoter deletion constructs shown in Figure 12 were fused to the 
GUS gene. The activity of the reporter gene was assayed qualitatively. The results 
showed that none of the deletions abolished the inducibility of the glucanase promoter. 
These results indicate the presence of multiple sites for transcriptional activation and 

20 initiation of transcription. 

HETEROLOGOUS PR OTEIN PRODUC TION USING TRANSFQRMANTS OF 
ASPERGILLUS NIGER COMPRISING THE GLUCANASE PROMOTER (GP^ AND 
THE QLVCANASE 3K?N AL S E QWNCE (Qss) 

25 

Transformation of Aspen^llus Niser 

The protocol for transformation of A. niger was based on the teachmgs of Buxton,F.P.. 
Gwymie D.I., Davis.R.W. 1985 (Trznsfommion of Aspergillus nigernsiug ihtargB gene 
30 of Aspergillus nidulans. Gene 37:207-214). Daboussi.M.J., Djeballi.A.. Gerlinger. C, 
Blaiseau. PL.. Cassan. M., Lebrun. M.H.. Parisot, D.. Brygoo.Y 1989 
(Transformation of seven species of filamentous fungi using the nitrate reductase gene of 
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Aspergillus nidulans, Curr, Genei. 15:453-456) and Punt, P.J.. van den Hondel. 
C. A.M. J. J. 1992 (Transformaiion of filamentous fungi based on hygromycin B and 
Phleomycin resistance markers. Meth. Enzym. 216:447-457). 

5 For the purification of protoplasts, spores from one PDA (Potato Dextrose Agar - from 
Difco Lab. Detroit) plate of fresh sporulaied N400 (CBS 120.49. Ceniraalbureau voor 
Schimmclcul cures. Baam) (7 days old) are washed off in 5-10 ml water. A shake flask 
with 200 ml Potato Dextrose Broth (difco 0549-17-9, Difco Lab. Detroit) is inoculated 
with this spore suspension and shaken (250 rpm) for 16-20 hours at 30°C. 

10 

The mycelium is harvested using Miracloih paper and 3-4 g wet mycelium are transferred 
to a sterile pctri dish with 10 ml STC (1.2 M sorbitol, 10 mM Tris HCl pH 7,5, 50^mM 
CaCU) with 75 mg lysing enzymes (Sigma L-2265) and 4500 units iyticase (Sigma L- 
8012). 

15 

The mycelium is incubated with the enzyme until the mycelium is degraded and the 
protoplasts are released. The degraded mycelium is then filtered through a sterile 60 pltxi 
mesh filler. The protoplasts are harvested by centrifugation 10 min at 2(X)0 rpm in a 
swing out rotor. The supernatant is discarded and the pellet is dissolved in 8 ml 1.5 M 
20 MgSO^ and then centriftiged at 3000 rpm for 10 min. 

The upper band* containing the protoplasts is transferred to another tube, using a transfer 
pipette and 2 ml 0.6 M KCl is added. Carefully 5 ml 30% sucrose is added on the top 
and the mbe is centrifuged 15 min at 3000 rpm. 

25 

Tne protoplasts, lying in the interface band, are transferred to a new mbe and diluted 
with 1 vol. STC. The solution is centrifuged 10 min at 3000 rpm. The pellet is washed 
twice with STC, and fmally solubilizcd in 1 ml STC. The protoplasts are counted and 
evenmally concentrated before transformation. 



30 
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For the transformation, 100 ^1 protoplast solution (10*-10- protoplasts) are mixed with 
10 /xl DNA solution containing 5- 10 DNA and incubated 25 min at room 
temperature. Then 60 % PEG-4000 is carefully added in portions of 200 ^1, 200 /xl and 
800 /xl. The mixture is incubated 20 min at room temperature. 3 ml STC is added to the 
mixnire and carefully mixed. The mixture is centrifugated 3000 rpra for 10 min. 

The supernatant is removed and the protoplasts are solubilized in the remaining of the 
supernatant. 3-5 ml topagarose is added and the protoplasts are quickly spread on 
selective plates. 

Glucanase promoter and heterologous gene expression 

Figure 13 shows the expression vector pGPAmy that was used in these studies. This 
expression vector comprises the glucanase promoter fused to the Thermomyces 
lanuginosus precursor form of the a-amylase gene. Transcription from the promoter is 
terminated using the xylanase A terminator. This construct was used in a co- 
transformation experiment with the hygromycin resistance gene as the selectable marker. 

The production of a-amylase using four independent transformams containing the 
expression vector pGPAmy when grown on sugar beet pulp and wheat bran is shown in 
Figure 14. The a-amylase activity was first detected in the culture medium after 48 
hours of growth. A peak of enzyme activity was observed after days 3 and 4. 

Glucanase signal sequence & heterologous protein production 

For these smdies. the expression vector pGPGssAmyHyg was used. 

The vector pGPGssAmyHyg is shown in Figure 15 This vector comprises the mature 
a-amylase gene translationally fused to the glucanase signal pepude (labelled ss) In 
addition, this vector comprises the promoter of the present invention (labelled EGl A) 
and the xylanase A terminator. Transcription from this vector is therefore under the 
control of the glucanase promoter and termination by the xylanase A terminator. 
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This construct was used to test inter alia the efficiency of the signal peptide in 
heterologous protein secretion. 

Figure 16 shows the results of the induciion of a-amylase by use of the construct in strain 
5 6M179 when grown in sugar beet pulp/ wheat bran. The results show that (he enzyme 
activity was localised in the culture medium and was first detected after 48 hours of 
growth. Accumulation of enzyme activity was observed at day 4. 

Western Blot 

10 

Figure 17 shows a Western blot of proteins from the supernatant of three independent 
transformants separated by SDS-PAGE and blotted to a membrane. A synthetic peptide 
with 15 amino acid residues of T lanuginosus a-amylasc recognised a single band on 
Western blots of culture supemaiants from the transformants. 

15 

Antibody Production 

Antibodies were raised against the enzyme of the present invention by injecting rabbits 
with the purified enzyme and isolating the immunoglobulins from antiserum according 
20 to procedures described according to N Harboe and A Ingild Clmmuruzation, Isolation 
of Inununoglobulins. Estunation of Antibody Titre" In A Manual of Quantitative 
Immunoelectrophoresis, Methods and Applications, N H Axelsen, et al (eds.). 
Universitetsforlagei, Oslo. 1973) and by T G Cooper ( 'The Tools of Biochemistry". John 
Wiley & Sons, New York, 1977). 

25 

SUMMARY 

Even though it is known that Aspergillus niger produces several enzymes which can 
degrade i3-glucan. the present invention provides a novel and inventive 0-1,4- 
30 endoglucanase. as well as the coding sequence therefor, the termination sequence 
therefor, the signal sequence therefor, and the promoter for those sequences. An 
imponani advantage of the present invention is that the enzyme can be produced m high 
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amounts. In addition, the promoter and the regulatory sequences ('such as the signal 
sequence and the terminator) can be used to express or can be used in the expression of 
GOIs in organisms, such as in ^4, niger. 

The enzyme of the present invention is advantageous for feed supplements. In addition, 
it can be used in the brewing industry as it has a high fibre-conversion potential. In 
addition, there are fewer processing problems when the enzyme is used, panicularly with 
non-starchy polysaccharides. In addition, the enzyme efficiently degrades jS-glucans, 
therefore it can be used advantageously in the brewing industry to lower viscosity and 
also improve the filierability of beer. This is important as large molecular weight glucans 
in beer and the like can cause filtration difficulties and give rise to sediments, gels and 
hazes. 



The signal sequence of the present invention is useful for secretion of a POI. such as a 
heterologous POI. thereby improving the quality and quantity of the POI. 

Other modifications of the present invention will be apparent to those skilled in the an 
without departing from the scope of the invention. 
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SEQUENCE INFORMATION 

ENZYME SEQUENCE 
SEQ ID NO: 1: 



Gin Thr Met Cys Ser Gin Tyr AsD Ser Ala Ser Sen Pro Pro "^yr 5er 

Id 10 15 

Val Asn Gin Asn Leu Tro Gly Glu Tyr Gin Gly Thr Gly Ser Gin Cys 

20 25 30 

Val Tyr Val Asp Lys Leu Ser Ser Ser Gly Ala Ser Trp His Thr Lys 

35 40 45 

Trp Thr Trp Ser Gly Gly Glu Gly Thr Val Lys Ser Tyr Ser Asn Ser 

50 55 60 

Gly Leu Thr Phe Asp Lys Lys Leu Val Ser Asp Val Ser Ser He Pro 
65 70 75 80 

Thr Ser Val Thr Trp Ser Gin Asp Asp Thr Asn Val Gin Ala Asp Val 

85 90 95 

Ser Tyr Asp Leu Phe Thr Ala Ala Asn Ala Asp His Ala Thr Ser Ser 

100 105 110 

Gly Asp Tyr Glu Leu Met He Trp Leu Ala Arg Tyr Gly Ser Val Gin 

115 120 125 

Pro He Gly Lys Gin He Ala Thr Ala Thr Val Gly Gly Lys Ser Trp 

130 135 140 

Glu Val Trp Tyr Gly Thr Ser Thr Gin Ala Gly Ala Glu Gin Lys Thr 
145 150 155 160 

Tyr Ser Phe Val Ala Gly Ser Pro He Asn Ser Trp Ser Gly Asp He 

165 170 175 

Lys Asp Phe Phe Asn Tyr Leu Thr Gin Asn Gin Gly Phe Pro Ala Ser 

180 185 190 

Ser Gin His Leu He Thr Leu Gin Phe Gly Thr Glu Pro Phe Thr Gly 

195 200 205 

Gly Pro Ala Thr Phe Thr Val Asp Asn Trp Thr Ala Ser Val Asn * 
210 215 220 
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ENZYME COOING SEQUENCE 
SEQ ID NO: 2: 

CAG ACG ATG TGC iCT CAG TAT GAC AGT GCC TCG AGC CCC CCA TAC TCG 
3TG AAC CAG AAC CTC TGG GGC GAA TAC CAG GGC ACT GGC AGC CAG TGT 
GTC TAC 3TC GAC AAC CTT AGC AGC AGT GGT GCC TCA TGG CAT ACC AAA 
TGG ACC TGG AGT GGT GGC GAG GGA ACA GTG AAA AGC TAC TCT AAC TCC 
GGC err ACG TTT GAC AAG AAG CTA GTC AGC GAT GTG TCA AGC ATT CCC 
ACC TCG GTG ACA TGG AGC CAG GAC GAC ACC AAT GTC CAA GCC GAT GTC 
TCA TAT GAT CTG TTC ACC GCG GCG AAT GCG GAT CAT GCC ACT TCC AGC 
GGT GAC TAT GAG CH ATG ATT TGG CTT GCC CGC TAC GGC TCA GTC CAG 
CCT ATT GGC AAG CAG ATT GCC ACG GCC ACT GTG GGA GGC AAG TCC TGG 
GAG GTG TGG TAT GGT ACC AGC ACC CAG GCC GGT GCG GAG CAA AAG ACA 
TAT AGC TTC GTG GCA GGA TCT CCT ATC AAC TCG TGG AGT GGG GAC ATT 
AAG GAC TTC TTC AAC TAT CTC ACC CAG AAC CAA GGC TTC CCG GCT AGC 
TCT CAG CAT TTG ATC ACT CTG CAA TTT GGA ACT GAG CCG TTC ACC GGT 
GGC CCG GCA ACC TTC ACG GH GAC AAC TGG ACC GCT AGT GTC AAC 
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PROMOTER SEQUENCE 
SEQ ID NO; 3: 

AATTGAAGCA TTTTGATAGG TTTAAGCCTA ATCAGGATAT TGGATGAGT: GAGTT3CAGA 60 

AGHGAGGAC GGTGGGTGAA ATCGGGGGTT TGATAGGTAG GCAATGCAGG GCGGAACGGG 120 

MGGGTCTAG ACAATTTCTT TCTTTTGGAC AGCTGGTGCG TTTCACTGAG ATTAATAGTA 180 

AGCAAACTAC TCGCTCGAAG TCGTAGATGT GCATAATGGA TAACTACAGC CAACCGAAAT 2^0 

CTCCGGGCAG AAGGCCTGGA GGCAGGAGGA AACGTGGATA AGAGAGTAAT GTTTGAGTAT 300 

AGATATGTAG GCAAGAAAGG ACTGGGAGGA AGGAAGTATC GCAAACAAGA CAAGTCACTG 360 

AATAGGAAAG AATGGGGCCA TCAGAGAAAT GAATCTAAAC GGTAACTGCA GATATTACAT 420 

GGAAGAAAAT ACTATGATCC CTAATTGATA TGGTTCCATG GCCCCTGGAG ACHAAACCT 480 

CGTGGTATGA TAAACATATG AGTTACATTC TCGGTAAATC CAACAHACT CCCAAGCTCT 540 

GTTGATATTC TCCGATAATT CACCGATAAC CAACCAACCT ACTCCCGTCT AGATCCAATT 600 

GGTCTATATG CATAATGGAT ATCGTCAGCA CAGGCAGAAC CCTTTAATTT ATTTCTGGAG 660 

ATCCCGTTCT CCACAATGCT IGGHGCCGA CTGCCACAGA CCATCGCTAA CTTGAAGCGG 720 

AAAGTGCTCC GATGAAGGGT CTCATTTTGA AACGGAGGAT HACATGICA ATGHGCAGG 780 

CIGGCGHGA TGATGGCGCA ACCTGCTATA GCTAGTTGGC TTACnCGTC CTGGCTGCCG 840 

TAnGGACAC GGAAA6TCGG ACAATAATAG TGTTAACAGT AA6CGCCATT GATCAGAGTT 900 

GATGTATTTA AAGCTGCGTC GTCTGCTGCC CCCTCCGTGT TCGTGTCTTA TTCCAAACAT 960 

TCAACCTCTA TTCCTTTCGA AGTCC1TTAG ATCTGCCGTT CCTCT6CTTT ATTGCCCAAC 1020 
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INFORMATION FOR SEQ ID NO: 4: 

(I) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 17 amino acids 

(B) TYPE- amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY, linear 
(11) MOLECULE TYPE; peptide 

(V) FRAGMENT TYPE: internal 
(VI) ORIGINAL SOURCE. 

(A) ORGANISM; Aspergillus niger 
(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Ser Trp Glu Val Trp lyr Gly Thr Sen Thr Gin Ala Gly Ala Glu Gin 
15 10 15 

Lys 

INFORMATION FOR SEQ ID NO; 5; 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 10 ammo acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(II) MOLECULE TYPE; peptide 
(V) FRAGMENT TYPE; internal 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
Thr Tyr Ser Phe Val Ala Gly Ser Pro He 
1 5 10 

INFORMATION FOR SEQ ID NO: 6: 

(1) SEQUENCE CHARACTERISTICS; 

(A) LENGTH; 35 ammo acids 

(B) TYPE; ammo acid 

(C) STRANDEDNESS; Single 

(D) TOPOLOGY; linear 

(II) MOLECULE TYPE; peptide 
(V) FRAGMENT TYPE: internal 
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(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
Lys Leu Val Ser Asp Val Ser Ser lie Pro Thr Ser Val Thr Xaa Ser 
1 5 10 15 

Gin AsD Aso Thr Asn Xaa Xaa Ala Ala Vai Ser Tyr Xaa 19'^ Phe Thr 
20 25 30 

Ala Ala Asn 
35 

INFORMATION FOR SEC ID NO: 7: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 11 ammo acids 

(B) TYPE: ammo acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(11) MOLECULE TYPE: peptide 

(V) FRAGMENT TYPE: internal 
(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Trp Thr Trp Ser Gly Gly Glu Gly Thr Val Lys 

1 5 10 

INFORMATION FOR SEQ ID NO: 8: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 
(11) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 
(XI) SEQUENCE DESCRIPTION: SEQ ID NO- 6. 
Leu Ser Ser Ser Gly Ala Ser Trp His Trr _ys 
1 5 10 
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INFORMATION FOR SEQ ID NO: 9; 

( I ) SEQUENCE' CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS. single 

(D) TOPOLOGY: linear 

(!■!) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 
(XI) SEQUENCE DESCRIPTION: SEO ID NO: 9: 
GTN CCR TAC CAN ACY TCC CA 17 

INFORMATION FOR SEQ ID NO; 10: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 
(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TGG ACN TGG WSN GGN GG 17 

INFORMATION FDR SEQ ID NO: U: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(0) TOPOLOGY: linear 

(II) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PCR fragment" 
(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



GTGGAGTGGT GGCGAGGGAA CAGTGAAAAG CTACTCTAAC TCGGSCCTTA 


u J 1 ■ i oAcAr- 




•3AAGCTAGTC AGCGATGTGT CAAGCAHCC CACCTCGGT6 ACATGGAGCC 






:aatgtccaa gccgatgtct catatgatct gttcaccgcg gcgaatgcgg 




180 


^CCAGCGGT GACTATGAGC TTATGATTTG GTATGTGACG TCGTGAACAA 




24C 


uGAGGCTAAC gtaaccaggc ttgcccgcta cggctcagtc CAGCCTATTG 




30C 


^GCCACGGCC ACTGTGGGAG gcaagtcctg GGAGGTCTGG tacgg 




345 
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INFORMATION FOR SEQ ID NO; 12; 

( ^ ) SEOUENCE CHARACTERISTICS ; 

(A) LENGTH; 2360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPO.LOGY: linear 

(11) MOLECULE TYPE. DNA (genomic) 
(VI) ORIGINAL SOURCE; 

(A) ORGANISM; Aspergillus niger 

(B) STRAIN; 3M43 
(IX) FEATURE; 

(A) NAME/KEY; COS 

(B) LOCATION; join (1021. .1427. 1476.. 1708. 1778.. 1857) 
(D) OTHER INFORMATION: /product- "Endoglucanase" 

/gene- "eglA" 

(IX) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 1021. .1427 
(ix) FEATURE: 

(A) NAME/KEY; intron 

(B) LOCATION; 1428. .1475 
(IX) FEATURE: 

(A) NAME/KEY; exon 

(B) LOCATION; 1476. .1708 
(IX) FEATURE: 

(A) NAME/KEY: Intron 

(B) LOCATION: 1709. .1777 
(IX) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 1778. .1854 
(IX) FEATURE: 

(A) NAME/ KEY; sig_peDtide 

(B) LOCATION :1G21. .1068 
(IX) FEATURE: 

(A) NAME/ KEY: iiiat_oeptide 

(B) LOCATION join( 1069. .1427. 1476. .1708. :7:r..l854) 
x:) SEOUENCE DESCRIPTION; SEQ ID NO :2- 
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AATTGAAGCA TTTTGATAGG TT7AAGCCTA ATCAGGATAT TGGATGAGTC GAGTTGCAGA 60 

AGTTGAGGAC GGTGGGTGAA ATCGGGGGTT TGATAGGTAG GCAATGCAGG GCG6AACGGG 123 

AAGGGTCTAG ACAATTTC7T TCTiTTGGAC AGCT6GTGCG TTTCACTGAG ATTAATA6TA 130 

AGCAAACTAC TCGCTCGAA6 TCGTAGATGT GCATAATGGA TAACTACAGC CAACCGAAAT 7^0 

CTCCGGGCAG AAGGCCTGGA GGCAGGAGGA AACGTGGATA AGAGAGTAAT GiTTGAGTAT 3CG 

AGATATGTAG GCAAGAAAGG ACTGGGAGGA AGGAAGTATC GCAAACAAGA CAAGTCACTG 36Q 

AATAGGAAAG AATGGGGCCA TCA6AGAAAT 6AATCTAAAC GGTAACTGCA GATiTTACAT 420 

GGAAGAAAAT AC7AT6ATCC CTAATTGATA TGGTTCCATG GCCCCTGGAG AC7TAAACCT 480 

CGTGGTATGA TAAACATAT6 AGTTACATTC TCGGTAAATC CAACATTACT CCCAAGCTCT 540 

OnGATATTC TCCGATAATT CACCGATAAC CAACCAACCT ACTCCCGTCT AGATCCAATT 60C 

GGTCTATATG CATAATGGAT ATCGTCAGCA CAGGCAGAAC CCTTTAATTT ATTTCTGGAG 660 

ATCCCGTTCT CCACAATGCT TGGTTGCCGA CTGCCACAGA CCATCGCTAA CTTGAAGCGG 720 

AAAGTGCTCC GATGAAGGGT CTCATTTTGA AACGGAGGAT TTACATGTCA ATGTTGCAGG 780 

CTGGCGTTGA TGATGGCGCA ACCTGCTATA GCTAGTTGGC TTACTTC6TC CTGGCTGCCG 840 

TATTGGACAC GGAAAGTCGG ACAATAATAG TGTTAACAGT AAGCGCCAH GATCAGAGTT 900 

GATGTATTTA AAGCTGCGTC GTCTGCTGCC CCCTCCGTGT TCGIGTCHA TTCCAAACAT 960 

TCAACCTCTA TTCCTTTCGA AGTCCTTTAG ATCTGCCGH CCTCTGCTTT AnGCCCAAC 1020 

ATG AAG CTC TCC ATG ACA CTT TCC CTG TTT GCG GCC ACT GCC ATG GGC 1068 
Met Lys Leu Sen Met Thr Leu Ser Leu Phe Ala Ala Thr Ala Met Gly 
-16 -15 -10 -5 

CAG ACG ATG TGC TCI CA6 TAT 6AC AGT GCC TCG AGC CCC CCA TAC TCG 1116 
Gin Thr Met Cys Ser Gin Tyr Asp Ser Ala Ser Ser Pro Pro Tyr Ser 

15 10 15 

GTG AAC CAG AAC CTC TGG GGC GAA TAC CAG GGC ACT GGC AGC CAG TGT 1164 
Val Asn Gin Asn Leu Trp Gly Glu Tyr Gin Gly Thr Gly Ser Gin Cys 

20 25 30 

GTC TAC GTC GAC AAG CTT AGC AGC AGT 6GT GCC TCA TGG CAT ACC AAA 1212 
Val Tyr Val AsD Lys Leu Ser Ser Ser Gly Ala Ser Tro His Thr Lys 

35 40 45 

TGG ACC TGG AGT GGT GGC GA.G 6GA ACA GTG AAA AGC TAC TC" AAC 7CC 1260 
Trp Thr Tro Ser G'y Gly Glu Gly Thr Val Lys Se*" Tyr Ser Asn Ser 
50 55 60 

Gly Leu Thr Phe AsD Lys Lys Leu Val Ser Asd val Ser Se*^ He ^rc 
65 "3 75 30 
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-C: TCG GIG ACA TGG AGC CAG GAC GAC ACC AAT GTC CAA GCC GAT GTC 1356 
"^hr Ser Val Thr Trp Ser Gin Asp Asp Thr Asn Val Gin Ala Asd Val 

85 90 95 

TCA TAT GAT CTG TTC ACC GCG GCG ^AT GCG GAT CAT 3CC ACT TCC AGC 1404 
Ser Tyr Asp Leu Phe Thr Ala Ala Asn Ala Asd His Ala T'nr Ser Ser 

100 . 105 110 

GGT GAC TAT GAG CTT ATG ATT TG GTATGTGACG TCGTGAACAA. 1447 
Giy Asp Tyr 61 u Leu Met He Trp 
115 120 
GATAGATGGA GGAGGCTAAC GTA^^CCAG G CTT GCC CGC TAC GGC TCA GTC CAG 1500 

Leu Ala Arg Tyr Gly Ser Val Gin 
125 

CCT ATT GGC AAG CAG ATT GCC ACG GCC ACT 6TG GGA GGC AAG TCC TGG 1548 
Pro lie Gly Lys Gin He Ala Thr Ala Thr Val Gly Gly Lys Ser Trp 

130 135 140 

GAG GTG TGG TAT GGT ACC AGC ACC CAG GCC GGT GCG GAG CAA AAG ACA 1596 
Glu Val Trp Tyr Gly Thr Ser Thr Gin Ala Gly Ala Glu Gin Lys Thr 
145 150 155 160 

TAT AGC TTC GTG GCA GGA TCT CCT ATC AAC TCG TGG AGT GGG GAC AH 1644 
Tyr Ser Phe Val Ala Gly Ser Pro He Asn Ser Trp Ser Gly Asp He 

165 170 175 

AAG GAC TTC HC AAC TAT CTG ACC CAG AAC CAA GGC TTC CCG GCT AGC 1692 
Lys Asp Phe Phe Asn Tyr Leu Thr Gin Asn Gin Gly Phe Pro Ala Ser 

180 185 190 

TCT CAG CAT TTG ATC A GTGAGTTTTC CTAATTCTAC TAGCGAGCGC 1738 
Ser Gin His Leu He 
195 

CGGCAGHGA AATTGGTCAC TAACAGAAGT GATGATTAG CT CTG CAA TTT GGA 1791 

Thr Leu Gin Phe Gly 
200 

ACT GAG CCG TTC ACC GGT GGC CCG GCA ACC TTC ACG 6TT GAC AAC TGG 1839 
'hr Glu Pro Phe Thr Gly Gly Pro Ala Thr Phe Th<- va^ Asd Asn Jro 

205 210 215 

aCC GCT AGT GTC AAC TAA AAGGCTHAG GCGCGGCTG3 GGTAAATAAC 1887 
"nr Ala Ser Val Asn * 
220 
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TTGGAMCAC TCAHCAAGA TCGGTACTCC TCHCAGCCG A6AAAGGCAC AGA7AGTGTA 2Q07 

TCGAA7CCAA TCAAA.TCTAT nGGTGTTGC TTAAATTCCG AGCCAGTCCT TCCTTGAM 2067 

GGTAA.TCCAC CCGTAGCGAT TGATCATTAA CAGATCCGAG TGGTGCTAGG "AAA77GCT 212^ 

AACCCGATCC CGCTCCAAH AGCTAGCGCA TCCGGCAGAT TCAAACTTGA CAGTGGGCCG 2157 

GGCAiTTACCT GAACCTGTAG .AAG6AACAGA CCCHGTCTA GAAATCTCTA AATAGTATAA 22-" 



(2) INFORMATION FOR SEO ID NO; 13. 
TERMINATOR SEQUENCE 

AAGGCTTTAG GCGCGGCTGG GGTAAATAAC AGCTTGTTTC nCGTTCTAG 50 
AACGTCGGGC GTGTAAGAGC TAGAAATCCA CCCACTCTGA TTGGAAACAC 100 
TCATTCAAGA TCGGTACTCC TCHCAGCCG AGAAAGGCAC AGATAGTGTA 150 
TCGAATCCAA TCAAATCTAT TTGGTGTTGC TTAAATTCCG AGCCAGTCCT 200 
TTCCTTGAAA GGTAATCCAC CCGTAGCGAT TGATCATTAA CAGATCCGAG 250 
TGGTGCTAGG TTAAATTGCT AACCCGATCC CGCTCCAATT AGCTAGCGCA 300 
TCCGGCAGAT TCAAACTTGA CAGTGGGCCG GGCATTACCT GAACCTGTAG 350 
AAGGAACAGA CCCTTGTCTA GAAATCTCTA AATAGTATAA GCCGAAACTT 400 
GCCCCGGACG TACCCTAAGC TAAGATTGCT CTTCGCAHC CCAGGGGGGT 450 
GAACTCTCTA AAGAGGGAGC ATCGCHGCC 6AT6TCTGGT TCGGGGATCA 500 
TGA • 5037 

(2) INFORMATION FOR SEO ID NO: 14: 
SIGNAL SEQUENCE 

ATG AAG CTC TCC ATG ACA CH TCC CTG TTT GCG GCC ACT GCC ATG GGC 48 

(2) INFORMATION FOR SEO ID NO; 15: 
SIGNAL SEQUENCE 




23C7 
2360 



Met Leu Ser Met Thr Leu Ser Leu Phe Ala Ala Thr Ala Met G^y 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuic I3bis) 



A. Tbc indications made below relate to the microorganism referred to in the description 
on page _ .line 2^ OA.g( Z9 

B. IDENTIFICATION OF DEPOSIT Funher deposits arc identified on ar\ additional sheet 
Name of depositary institution 

The National Collections of Industrial and Marine Bacteria Limited (NCIMB) 
Address of deposiury institution (inchding postcl coieciU country) 

23 St. Machar Drive 
Aberdeen 
Scotland 
AB2 IRY 



United Kingdota 



Dale of deposit 


Accession Number 







C. ADDITIGNAL INDICATIONS (leave Mtnk if t%otQppiicablc) This information is oominued on an additional sheet 



In respect of those designations in which a European patent is sought, and any 
other designated state having equivalent legislation, a sample of the deposited 
microorganism will be made available until the publication of the mention of the 
grant of the European patent or until the date on which the application has been 
refused or vitlidrawn or is deemed to be withtlrav/n, only by the issue of such a 
sample to an expert nominated by the person requesting the sample. (Rule 28(4) 
EPC), 

D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE {if the indicsiions arc not for ail ^csigMiedSteta} 



E. SEPARATE FURNISHING OF INDICATIONS fWeb/cni if not oppiicabic) 



The indicactons iisied below will be submitted [Oihe Intemationai Bureau later {spectfyUic gentrai nature of the matcattons eg,, 'Accession 
Numoer ofDeposu'} 



For receiving Office use only — - 



This sheet was r eoeived w njhjhe international application 




For International Bureau use only 



n This sheet was received by the International Bureau on: 



Authorized officer 
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CLAIMS 

1. An enzyme obtainable from Aspergillus, wherein the enzyme has the following 
characteristics: 

a. a MW of 24,235 D ± 50 D 

b. a pi value of about 4 

c. glucanase activity 

wherein the glucanase activity is endo |3- L4-glucanase activity. 

2. An enzyme having sequence shown as SEQ. I.D. No. 1 or a variant, homologue or 
fragment thereof. 

3. An enzyme coded by the nucleotide sequence shown as SEQ. I.D. No. 2 or a 
variant, homologue or fragment thereof or a sequence complementary thereto. 

4. A nucleotide sequence coding for the enzyme according to claim 1. 

5. A nucleotide sequence coding for the enzyme according to claim 2. 

6. A nucleotide sequence having the sequence shown as SEQ. I.D. No. 2 or a variant, 
homologue or fragment thereof or a sequence complementary thereto. 

7. A nucleotide sequence according to any one of claims 4 to 6 operatively linked to 
a promoter. 



8. A nucleotide sequence according to claim 7 wherein the promoier is the promoter 
having the sequence shown as SEQ. I.D. No. 3 or a vanam, homologue or fragment 
thereof or a sequence complementary thereto. 



5 
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9. A promoter having the sequence shown as SEQ. I D. No. 3 or a variant, homologue 
or fragment thereof or a sequence complementary thereto. 

10. A promoicr according to claim 9 operatively linked to a GOI. 

11. A promoter according to claim 10 wherein the promoter is operatively linked to a 
GOL wherein the GOI comprises a nucleotide sequence according to any one of clamis 
4-6. 



10 12. A terminator having the nucleotide sequence shown as SEQ, I.D. No, 13 or a 
variant, homologue or fragment thereof or a sequence complementary thereto. 

13. A signal sequence having the nucleotide sequence shown as SEQ. I.D. No. 14 or a 
variant, homologue or fragment thereof or a sequence complementary thereto. 



15 



20 



25 



14. A construct comprising or expressmg the invention according to any one of claims 
1 to 13. 

15. A vector comprising or expressing the invention of any one of claims 1 to 14. 

16. A plasmid comprising or expressing the invcniion of any one of claims 1 to 15. 

17. A transgenic organism comprising or expressing the invention according to any one 
of claims 1 to 16. 

18. A transgenic organism according to claim 17 wherein the organism is a fungus. 



19. A transgenic organism according to claun 17 wherein the organism is a filamentous 
fungus, preferably Aspergillus. 

30 

20. A transgenic organism according to claim 17 wherein the organism is a plant. 
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2L A transgenic organism according to claim 17 wherem the organism is a yeast. 

22. A process of preparing an enzyme according to any one of claims 1 to 3 comprising 
expressing a nucleotide sequence according to any one of claims 4-8. 

23. A process according to claim 22 wherein the enzyme has the sequence shown as 
SEQ. I.D. No. 1 or a variant, homologue or fragment thereof, and the nucleotide 
sequence has the sequence shown as SEQ. I.D. No. 2 or a variant, homologue or 
fragment thereof or a sequence complementary thereto. 

24. A process according to claim 22 or claim 23 wherein the expression is controlled 
(partially or completely) by use of a promoter according lo claim 9 

25. A process for expressing a GOI by use of a promoter, wherein the promoter is the 
promoter according to claim 9. 

26. Use of an enzyme according to any one of claims 1 to 3 or prepared by a process 
according to any one of claims 22 to 25 to degrade a glucan. 

27. Plasmid NCIMB 40704, or a nucleotide sequence obtainable therefrom for 
expressing a glucanase enzyme or for controlling the expression thereof or for controlling 
the expression of another GOI. 

28. A glucanase enzyme having the ability to degrade /S-l,4-glucosidic bonds, which is 
immunologically reactive with an antibody raised against a purified glucanase enzyme 
having the sequence shown as SEQ. I.D. No. 1. 

29 A signal sequence having the sequence shown as SEQ. I.D. No. 15 or a variant, 
homologue or fragment thereof. 
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FIGURE 1 ^ 

AMY 637 PROMOTER 

SEQUENCE TYPE: Nucleotide 

MOLECULE TYPE: DNA 

ORIGINAL SOURCE: Solanum Tuberosum 

SEQUENCE LENGTH: 2094 

SEQUENCE: 

10 -20 30 dO 

ATTAAGGGGA 6CATAAGTGC AGCTCAGAAA TTCACACCTG 
50 60 73 80 

ATATTTTCCC AAAGCCCTCA AAAATGTGAA CAAATCTGCi 
90 100 110 120 

AAAATGTCAG TCAGAAGGAC TGnCTTTTA GGTTTTCTTC 

130 140 150 160 

TCTC6AGTCA CGAAATCAGA TAATATGATA A6AAATTATG 

170 180 190 200 

GAGGATTTAT AATGTATCTG TCTGTrCTTA GGTATAATTA 

210 220 230 240 

TGTGTrCCrr TATGATGTAG TAATGGAATT CTGGGCnAT 

250 260 270 280 

ATTAAAGGAA CTGAATATAA ATGTTCGCAT mTAACTGCG 

290 300 310 320 

GAGACTTCGA GTTAGAGCCT TATAATTATG tcttatcatt 

330 340 350 360 

ttatactgag atcatattac agatgatgaa agctgacah 

370 380 390 400 

GCAnAGTTA TrCTGrUTA TACAAGTCAT GTAACTGCTG 

410 420 430 440 

CTTGTGAGTT 6T6ACT6TAA GATAAATTGA TTCAGCCTTC 

450 460 470 480 

TGTGGCATTA 6CG6AGATCT GATTATACTC TCATCGTCTT 

490 500 510 520 

ATCTAAGTTG CTCATGCAAC TTTGTCCnG ATAGTTG6CT 

530 540 550 560 

AATACTACAA CTGCAAHAA GTGTAGHAT TCGAAATCTC 

570 580 590 600 

TGTTGGAAGT TGCTAAGTGC HAAGIGCTG GTTATTGTAA 

610 620 630 640 

ACCCCATCCG AGTTATTATA CAGCATCTGG CTGATGAAAT 

650 660 570 580 

GCTGCTCAH TGCAATGGTG ACATAACCAA ATGTTAGTAA 

690 700 710 720 

AACATACTA6 CTGGTT6AAT GHAGATGAT TGTTCAACGT 

730 740 750 760 

TACATCTCAC AGAAACCTTA TTATG^GAnG ACATGHAGT 

770 780 790 800 

TGATCCGAAA GATCCTTCT TTAAATGCCA AAGCTTGTA 

810 820 830 34C 

CAGATTTGAG GAGTTC^n-T AC~r:T^ GTTATATCT; 

... §5? _ _860 _ __87C ^880 
i ! iCCCAi iC Aim ijACGT 7CAGCC~CAC AGATGi iGTC 

^ ,__.J?5 '10 920 

A I AC 1 I AuAA ATGTGCG > AT ATiTATAGAG AGAGAGAGAT 
930 940 950 -60 

AGAGTGAAAT GATTATATAG "CGAAGATTA CGAAACGA 
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970 980 990 1000 

CATTGAGACA TCTGTGATTG TTTGAAA~ ATGTATATAT 

1010 1020 1030 1040 

CTGTAGCAn AGAAACTATA AGAGTTGnA GCTTCACnC 

1050 1060 1070 1080 

TCTTAT^Gn GTGCTCAAAG CAACrrCATC ATACAGTATG 

1090 1100 iilO 1120 

GimTATAT GCTCrrCCAT TATCACCGAA CCTTATGATT 

1130 1140 1150 ilcO 

ATGTGTACGA GC77ATAATA TTACTGATGG TGATTZAGT- 

1170 1180 1190 1200 

7TATGATTAT GTCCTCCAH AATTATTCT3 TTCATACAA 

1210 1220 1230 1240 

GTCGTGTAAT TTGCTGTTTG TGATTGTACG ATAAATTGAT 

1250 1260 1270 1280 

TCAACC77CT GCGGTGTTGG TTGAAGTrCA AGTAAA7TAG 

1290 1300 1310 1320 

CTTTATTTA7 CATAGTAGCA TTTGAnATT GATGCTCTGT 

1330 1340 1350 1360 

A6CTAATGAT AAGCCATTGA AGGGAA6CAG AAATGGTAAA 

1370 1380 1390 1400 

GCTTTCTAAA AT6AATCTAC GAATGGATGA TAAAGTTAAT 

1410 1420 1430 1440 

GAATATTGTT GATACTTCTG CAATCAGATT ATGAGHACT 

1450 1460 1470 1480 

6AGTCTACTG TTnTTAAGC CTGTTTCAGA TGATCGATCA 

1490 1500 1510 1520 

TCAACAACAA CATAITCAGT GTA6TA6ACA TGATCGATCA 

1530 1540 1550 1560 

CTTTCTAAn TTCGATTATG CACCCTCTTT TCTCCAATTT 

1570 1580 1590 1600 

GGTC6TCTTC lliilllCAT GAT6TCACTG AAnATTCTC 

1610 1620 1630 1640 

TGGTCGTCCC CACCAHCAG GAAGTCACTT CGAGCATAAT 

1650 1660 1670 1680 

GTGAAAACAT CCACATTTTT CAAATCCAGC AGAATTTTCA 

1690 1700 1710 1720 

TCAAACGGG: TTCAACAm ACTACATGTA TACACTCTGA 

1730 1740 1750 1760 

AGTCTGAATC CACTAAHCT AGATGGTGCA TCTGTGCCCC 

1770 1780 1790 1800 

CACACTTGTG AAAGCT7ATT CTCAATTTTT TATTTTCCAA. 

1810 1820 1830 1840 

CAACTTGAAT TCAGACCACA CAACTCCCGT GTCTTGTACG 

1850 1860 1870 1380 

GTCAGCATCT GAGTGGAGAA CTCAAITAAG TGACTTTAAC 

1890 1900 1910 1920 

GTCGAG^CT ATAGTAAACA ACCCCTATAT CTTTTTTCAA 

1930 1940 1950 -^O 

GCATG. .AA6 ATTGCGAACA CACTGAAATT 'CCAGG'CG' 

1970 1980 1990 2000 

TAATC^GTA CCCAGTGTGT GTACTTTTJA AAAAAAAAG" 

2010 2020 2030 2040 

CAGTi GT:t:TAAAA CACATTTAAA TAGAGT7TAT 

_ 205C 2060 2070 2080 

■■GCCATc,, ttgttc:tca tactagact' cggag'caa: 

2290 
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':3Lr£ : ::fr:NUE: 
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FIGURE 2 

AMY 351 PROMOTER 
SEOUENCc TYPE; Nucleotide 
MOLECULE TYPE: DNA (genomic) 
ORIGINAL SOURCE Solanm tuDerosum 
SEQUENCE LEM6TH: 1734 bp 
STRANDEDNESS : DouDle 
TOPOLOGY Linear 
SEQUENCE: 

10 20 30 40 

TCnTAAGTT G'HTGCTTGA TTTTTCTTC" TCAATCTTCT 
50 60 70 80 

ATATTTAATT C6TTTTAGCT TCAAACTTCT TCAATTTTAT 
90 100 110 120 

TTCAATTTAA TTCTACAAAA AAAATCTCTA TTTAGCACCA 

130 140 150 160 

TTCATAAAAT TCATGCTCAA AATGGGCAAA CATAAATAAT 

170 180 190 200 

AAATGTGAAG TAAATAATGG AHAAAATAT ATATTTTTGG 

210 220 230 240 

GCCTCACATC AACCTTCATA ATTCTTGAAT GAATGAATGA 

250 260 270 280 

TAGAC1TCAT AATTmTAA CCTATACATA TAAGAAAATT 

290 300 310 320 

GA6AGTAACT CAAATAACAA 6TTGTA6TAT CACATCTHA 

330 340 350 360 

CTATTTGATA ACAHATGAA GGTGATTATA CAHACGTAA 

370 380 390 400 

CATTTCTTTT AAAAATATGT AAGCAAATTT ACTTnTAAC 

410 420 430 440 

TTATCAHGA TCTTCATG6T nTGTCATAA ATCTCAAA6T 

450 460 470 480 

TATCATATTT TATATAGCTA TTTGAAAGTA ATTTTATTTT 

490 500 510 520 

TACTCATCAT T6AGT6ATGC TTTTATTATA ATACTAGTAA 

530 540 550 560 

GTTTTATTTA TTATTTTCn TTAGGGGTGA ATTGTATAAT 

570 580 590 600 

ATAATAAAAA ATATATTTTT AGAAATAATG AnCTTTTAT 

610 620 630 640 

TATTAAAAAG TTAAGATATT AGATTATTTA TGCTTGTATA 

650 660 670 680 

ATAATGAACG AAGTTTTATT TTCTATGAGT TTCATTAATC 

690 700 710 720 

ATGTTTGTAA TTATiTCAAA TTiTGATGTA TTTTTATAAT 

730 740 750 760 

TTTGTATTAT TATATTAHA TACTATATTT AAAAATTTAA 

770 780 790 800 

AGATCCATAG GGCTTACGCC CCACGTCAAG AGGCTTGCGC 

810 820 830 840 

CTTTCCCTAA AHAAGTAAA ACTCTTCGCC TCATGCCTTA 

B50 860 870 880 

C3CCTCCGCC rTTAAAACA CTGATTCCTT TCCTCATATA 
_ 890 900 910 920 

GCi;GAGGCG AAAATATTTA ATAAAAACAC "CTAATTT 

930 940 950 960 

G. . lATATGT TCAA.i iGAAC ATGTCCGTGA ~AGAAAATT 
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FIGURE 2 CONTINUED ^ 

970 980 990 iOOO 

AAATTAAAn CAATGACAAA TTAATAATT TGACACAAAA 

1010 1020 1030 1040 

"TTATGAAAA AAATATCAAA ATATAAAGAA ATATTT7TTT 

1050 1060 1070 1080 

TGAAATGGAT TAAAAA6AAA AAAAAAACAA ATAAATTGAA. 

1090 1100 1110 1120 

CCGGGATAAG TTGGTTGTTT AATTGAfTAT TGAHATGAT 

1130 1140 1150 1160 

CTCAATTTGA CATTTTGCGC GATCTT7CGA CCTCAATTC6 

1170 1180 1190 1200 

TATGAACTGA CACTACGCCA ATGGACAGTC GCCGTCGTCA 

1210 1220 1230 1240 

CCGCCACCGC ACTATTCTCG ACGCGTCGTC TATCTCCTCC 

1250 1260 1270 1280 

ACCCCACAGC CGTCAATTCC AAGCTTCCAA TGAACCGTTG 

1290 1300 1310 1320 

CCATGTGTCA CTGCCTATTC ACCGCGAAAC ATGAATATCA 

1330 1340 1350 1360 

CTGACGAACG ATTTCGGAGC GGAAC6AATC CAGAAAATGG 

1370 1380 1390 1400 

AnACTTTCT ATAAAHCCT CGAATCTCAA CTCCATTTCG 

1410 1420 1430 1440 

TAAAAATAAA AHAAAAATA TTGmCTTT TTGTATTTCT 

1450 1460 1470 1480 

TTTTGTATTT CTGGTTTATG TGGTGATCGA ATTTTCAATT 

1490 1500 1510 1520 

rmTACTGG TA6TGATTCC TACTTTTCTT CAAHGCATT 

1530 1540 1550 1560 

TCTCCTTTTT CCATTTCACG GTT6AGAATT CATGATTCCT 

1570 1580 1590 1600 

TATCAGAGGA ATCGATCCGA TTTGACTAAT TTCACTTTTC 

1610 1620 1630 1640 

GTCTGTATAA ATACCAGAGT ATCTAGGTTG AGGAACGTAA 

1650 1660 1670 1680 

TTTCAAGCTG CGATCGGCTT TTTCCCCTGA ACGAGCAAAC 

1690 1700 1710 1720 

ACAGGnCTG GGTTCGAGn A6CAAGGGAC GTATAATCTC 

1730 
AACTACAATC CATT 
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FIGURE 3 5/31 

a-AMYLASE CODING SEQUENCE 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2017 base Ddirs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS douDle 

(D) TOPOLOGY linear 

(A) LENGTH: ^75 amino ac^ds 

(B) TYPE: amino ac'd 

iZ) STRANDEDNESS: sinai-= 
(D) TOPOLOGY: linear 

AT6 AAG TCT CTC GCC GCA AH GCT GCT CTG CTG ~CG CCC ACA CTG GTC 48 
Met Lys Ser Leu Ala Ala lie Ala Ala Leu Leu Ser Pro Thr Leu 7a 1 
-18 -15 -10 -5 

CGG GCA GCG ACT CCG GAT GAG TGG /vAA GCT CAG TCG ATC TAT TTC ATG 96 
Arg Ala Ala Thr Pro Asp 61u Trp Lys Ala Gin Ser He Tyr Phe Met 
1 5 10 

CTG ACG GAC CGG TTT GCG CGT ACC GAC AAT TCG ACC ACG GCT CCC TGT 144 
Leu Thr Asp Arg Phe Ala Arg Thr Asp Asn Ser Thr Thr Ala Pro Cvs 
15 20 26 30 

GAC ACC ACT GCC GGG GTATGCAACT AACCCTGTGT TTCTCTTCCC GGGACGTACA 199 
Asp Thr Thr Ala Gly 
35 

AGGGGTCTTC TCCATGCTAA CCGTGCACAT GCAG AAA TAT TGC GGG GGA ACA 251 

Lys Tyr Cys Gly Gly Thr 
40 

TGG CGA GGT ATC ATC AAC AAC GTAAGTGGCT TCTGATTTTC GCTCAATAAT 302 
Trp Arg Gly He He Asn Asn 
45 

CnCGTCGCG TGACTTTATT TCCTAG CTG GAT TAC ATC CAG GAT ATG GGC TTC 355 

Leu Asp Tvr lie Gin Asp Met Gly Phe 
50 ' 55 

ACA GCT ATC TGG ATA ACT CCA GTG ACA GCC CAG TGG GAC GAC GAT GTG 403 
uThr Ala lie irp lie ihr Pro Val Thr Ala Gin Tro Asd Asd Asd Val 
60 65 70 

GAT GCG GCA GAT GCA ACG TCG TAT CAC GGT TAT TGG CAG AAA GAC C ^50 
Asp Ala Ala AsD Ala Tnr Ser Tyr His Gly Tvr irs Gin Lvs Aso ^eu 
75 80 " ' 85 ' 



6 1 GCGCAACC CTGCTCCATG GA . CGCTGGC : GCAAACTCG T3CT3AT^.GG 1 3A . 



rGAAACAG A TAC TC" CTG AAT TCG AAA FTC GGC -C~ GCC 560 
Tvr Se"" Leu Asn Ser Lvs Phe G"iv Tnr A'la 
90 95 
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GAT GAC T-G AAA GCC CTG GCT GAT GCC CTT CAC GCC CGT GGG AT3 CTT 608 

AsD Asp Leu Lys Ala Leu Ala Asd Ala Leu His Ala Arg Gly Met Leu 

100 105 110 115 

CTC ATG G"C GAC GTC GTG GCT AAT :aC TTT GTACGGACCA TCTACATiCC c58 
L9U Met val Asp Va 1 val Ala Asn H'3 ?he 
120 125 

TGGGAAACGC 3.AAGAAGGAA AAAAAAAAA;\ AGGCGCACGC TAACATTTCG CG'TTAG 715 

GGC tag GGC GGT TCT cat AGC gag GTG GAT TAG TCG AK TTC AAT CC ^53 
Gly Tyr Gly Giv Ser His Ser Glu Val Asd Tvr Ser He Phe Asn P^o 
130 135 ■ UO 

CTG AAC AGC GAG GAT TAG TTC CAC CCS TTC TGT CTC ATT GAG GAC TAG 311 
Leu Asn Ser Gin Asp Tyr Phe His Pro Phe Cys Leu He Glu Asp Tyr 
145 150 155 

GAC AAC GAG GAA GAA GTC GAA CAA TGC TGG CTG GCC GAT ACT CCG ACG 359 
Asp Asn Gin Glu Glu Val Glu Gin Cvs Trp Leu Ala Asd Thr Pro Thr 
160 165 ' 170 

ACA HG CCC GAC GTG GAC ACC ACC AAT CCT CAG GTT CGG ACG TIT TTC 907 
Thr Leu Pro Asp Val Asp Thr Thr Asn Pro Gin Val Arg Thr Phe Phe 
175 180 185 

AAC GAC TGG ATC AA6 AGC CTG GTG GCG AAC TAG TCC A GTATGATTGT 954 
Asn Asp Trp He Lys Ser Leu Val Ala Asn Tyr Ser 
190 195 200 

TCCCGCGGTA ACGCTTTAGG GCTTGCTCTA ACTGAAATC6 ACAG TC GAT GGT CTG 1009 

He Asp Gly Leu 
205 



CGC GTC GAC ACC GTT AAG CAC GTG GAG AAA GAT TTC TGG CCC GAC TTC 1057 
Arg Val Asp Thr Val Lys His Val Glu Lys Asp Phe irp Pro Asp Phe 
210 215 220 



AAC GAA. GCT GCT GCG TGT ACC GTC GGC GAG GTG TTC AAC GGT GAC CCA IIDS 
Asn Glu Ala Ala Ala Cys Thr Val 31 v Glu Val Phe Asn Gly Asd Pro 
225 230 235 

GCG TAC AC: TGC CCA TAC CAG GAA GTG CTG GAT GGC GTT CTG AAC TAT 1153 
Ala Tvr Thr Cys Pro Tyr Gin Glu val Leu AsD Glv -Jal Leu Asn ivr 
2^0 2^5 ' 250 

CCG AT 3TGAGTGATT CCGAAAGTTC CATCGATCAG GCTTTCTGAC GCATGAGAAC 1208 
Pro He 
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AGC TAC TAT CCT GCG CTJ GAT GCA TTC AAG TCT GTC GGC GGC ^AT CTC 1256 
Tyr ~vr Pro Ala Leu Asd Aid Phe Ivs Ser Vdl Glv Giv Asn Lej 
260 '265 ' ' 27G 

GGC GGC TTG GCT CAG GCC ATC ACC ACC GTG CAG GAG AGC T3C AAG GAT :20<^ 
G I / Gly Leu A'a Gin Ala He Thr Thr Val Gin Gu Ser Cvs Lvs Asc 
275 230 " 285 

TCC AAT C'G CTC GGC AAT TTC Cr GAG AAT CAC GAC ATT GCT CGC TT^ 1352 
Ser Asn ^_9u ueu G.v Asn Phe Leu Glu Asn His Asp ile Ala Acq Phe 
290 ' 295 300 

GCT TC GTATGGACAC TCTTTTTGAA GCCCTCATC3 ATTGGGGATG CT3ACACGGA :iQ7 
Ala Ser 

CAACAACAAC AG G TAC ACG GAT GAC CTJ GCT CTC GCC AAG AAT GGT CTC 1456 
Tyr Thr Asp Asp Leu Ala Leu Ala Lys Asn Gly Leu 
305 310 315 

GCT nC ATC ATC CTC TCG GAT GGT AH CCG ATC ATC TAC ACG GGC CAG 1504 
A1a Phe He He Leu Ser Asp Gly lie Pro He lie Tyr Thr Gly Gin 
320 325 330 

GAG CAG CAC TAC GCC GGT GAT CAC GAT CCC ACA AAT CGT GAG GCC GTC 1552 
Glu Gin His Tyr Ala Gly AsD His Asp Pro Thr Asn Arg Glu Ala Val 
335 340 345 

TGG CTG TCT GGC TAC AAT ACC GAC GCC GAG CT6 TAC CAG TTC ATC AAG 1600 
Trp Leu Ser Gly Tyr Asn Thr Asp Ala Glu Leu Tvr Gin Phe lie Lvs 
350 355 360 

AAG GCC AAT GGC ATC CGC AAC HG GCT ATC AGC CAG AAC CCG GAA TTC 1548 
Lys Ala Asn Gly He Arg Asn Leu Ala He Ser Gin Asn Pro Glu Phe 
365 370 375 380 

ACC TCC TCC AAG GTGAGTACAA TAACAAACTT TTC6AAAAAT T7TTCACCGG 1700 
Thr Ser Ser Lys 



AGAAAACCTA AGATTCGGCT AACAAAACAA AAAAAAAAAA G ACC AAG GTC ATC 1-53 

Thr Lys Val He 

385 

• CAA GAC GA''" fCG ACC C^ i GCC ATT AAC C3G GGC GGC GTC G"^ AC< .301 
^vr G'C Aso AsD Ser Thr lS'j Aia He Asn Arg Glv G^v Va- Va^. """^r 
390 395 400 

3TC CiG AGC AAT GAA GGC GCC »CC GGl3 GAG ACC GGG ACi GTC ""CC *3^5 
vai Leu 3e^ Asn Glu Glv Ala Ser Glv Glu Thr Glv ^hr Val Ser ye 
405 410 415 ^ 42C 
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CC5 GGA ACT GGC TTC GAG GCC GGC ACG GAA TTG ACT GAT GTC ATC TCC 189^ 
Pro Gly Thr Gly Phe Glu Ala Glv Thr Glu Leu Thr Asd Val He Ser 
425 430 435 

T3C AAG ACC GTG ACT GCG GGG GAC AGC GGG GCG GTC GAC GTG CCC *T3 1945 
Cvs Lvs !hr Val ihr Ala G.v AsD Ser Glv A'.a Val AsD Val Pre Leu 
440 ' 445 ' 450 

TCG GGC GGA CTG CCA AGC GTG CTC TAT CCC AGC TCC CAG CTG GCC AAG 1993 
Ser Gly Glv Leu Pro Ser Val Leu Tyr Pro Ser Ser Gin Leu Ala lvs 
455 460 465 

AGT GGT CTG TGT GCG TC3 GCG TGA 20 1 7 

Ser Gly Leu Cys Ala Ser Ala 
470 475 
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a-AMYLASE CODING SEQUENCE 

SEQUENCE TYPE: Nucleotide 

MOLECULE TYPE ONA 

ORIGINAL SOURCE Soianum Tuberosum 

SEQUENCE LENGTH: 1570 

SEQUENCE 

iO 20. 30 'iO 

TGTGGTGATC GAy^TTTTCAA . : 1 1 , i .ACT GAGTATCTAG 
50 60 ^0 80 

GHGAGGAAC GTAATTTCAA GCTGCGATC3 GCT^TTTCCC 
90 100 110 120 

CTGAACGAGC AAACACAGGT TGTGGGTTCG AGTTAGCAAG 

130 140 150 160 

GGACGTATAA TCTCAACTAC AATCCAHAT GGCGCTTGAT 

170 180 190 200 

GAAAGTCAGC AGTCTGATCC AnGGHGTG ATACGCAATG 

210 220 230 240 

GAAAGGAGAT CATATTGCAG 6CATTCGACT GGGAATCTCA 

250 260 270 280 

TAAACATGAT TGGTGGCTAA ATTTAGATAC GAAACHCCT 

290 300 310 320 

GATAnGCA;^ A6TCTGGTTT CACAACTGCT TGGCTGCCTC 

330 340 350 360 

CGGTGTGTCA 6TCATTGGCT CCTGAAGGH ACCHCCACA 

370 380 390 400 

GAACCTTTAT TCTCTCAAH CTAAATATGG TTCTGAGGAT 

410 420 430 440 

CTCHAAAAG CTTTACTTAA TAAGATGAAG CAGTACAAAG 

450 460 470 480 

TTAGAGCGAT G6CGGACATA GTCATTAACC ACCGTGTTGG 

490 500 510 520 

GACTACTCAA G6GCATGGTG GAATGTACAA CCGCTATGAT 

530 540 550 560 

GGAATTCCTA TGTCTTGGGA TGAACATGCT AHACATCH 

570 580 590 600 

GCACTG6TGG AAGGGGTAAC AAAAGCACTG GAGACAACTT 

610 620 630 640 

TAATGGAGTT CCAAATATAG ATCATACACA ATCCTTTGTT 

650 660 670 680 

CGGAAAGATC TCATTGACTG GATGCGGTGG CTAAGATCCT 

690 700 710 720 

CTGTTGGCTT CCAAGATTTT CGTTTTGATT TTGCCAAAGG 

"30 740 750 760 

TTATGCTTCA C^AG1A1GJ^A AGGAATATAT CGAGGGAGCT 

'70 780 790 800 

GAGCCAATiT TTGCAGTTGG AGAATACTGG GACACTTGCA 

810 820 830 840 

ATTACAAGGG CAGCAA.TTTG GAHACAACC AAGATAGTCA 

850 860 870 880 



wAuuCAAAGA ATCATCAt'TT GGA 1 1 GATGG CGCGGGACAA 



890 900 
CTTTCAA.C"3 CATTCGATTT T, 




910 920 
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930 940 950 960 

AGGAAGCAGT CAAAGGAGAA nCTGGCGTT TGC3TGACTC 
970 980 990 1000 

TAAGGGGAAG CCCCCA6GAG TTTTAGGAIT GTGGCCTTCA 

1010 1020 1030 1O40 

AGGGCTGTCA CTTTTAnGA TAATCACGAC ACT3GATCAA 

1050 1060 1070 1O80 

CTCA6GCGCA TTGGCCTTTC CCITCACGTC ATGTTATGGA 

1090 1100 HID 1120 

6GGCTATGCA TACAnCHA CACACCCAGG GATACCATCA 

1130 1140 1150 1160 

GUTTCTTTG ACCATTTCTA CGAATGGGAT AATTCCATGC 

1170 1180 1190 12O0 

ATGACCAAAT TGTAAAGCTG AnGCTATC GGAGGAATCA 

1210 1220 1230 1240 

AGGCATACAC AGCCGTTCAT CTATAA6AAT TCHGAGGCA 

1250 1260 1270 1280 

CAfiCCAAACT TATACGCT6C AACCAHGAT GAAAAGGHA 

1290 1300 1310 1320 

GCGTGAAGAT TGGGGACGGA TCATGGAGCC CTGCTGGGAA 

1330 1340 1350 1360 

AGA6TGGACT CTCGCGACCA GTG6CCATCG CTATGCA6TC 

1370 1380 1390 1400 

TGGCA6AAGT AATCTTACA6 CTATTCCGn ACHAATATA 

1410 1420 1430 1440 

TTAGTAGAAA TATATATGH HAAACCCGA GCACCTACTT 

1450 1460 1470 1480 

CTAACACTAG ATCCGCCTCT ACAGGCTTGG ATGGAGTGAT 

1490 1500 1510 1520 

GAGIIIMI! nCCTGTTCA HAGACATTG CAACATGGGA 

1530 1540 1550 1560 

TGTATGTTTT GHAATAAAA GTGTTCnGA TCAATGCAAT 

1570 
GTAATAAGGG 
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FIGURE 5 

SEQUENCE. Nucleotide sequence of a cONA encoding tne large suDurit c~ ADP- 

glucose pyroDhosDnorvlase from Derley seed encosperm (beoUOl 
SEQUENCE TYPE: NUCLEIC ACID 
MOLECULE 'YPE; ONA 
ORIGINAL SOURCE BARLEY 
SEQUENCE LENGTH: 2037 
STRANDEDNESS: DOUBLE 
TOPOLOGY : LINEAR 



1 ACGACCACCT CCGAACTCAA CGCCTCCACG GACCATCTCT 
41 CTCCTCTCCC CTCCCCTCAC CACCACCACC ACCACCACCC 
81 CTTCTCCCTC CCTGCATTTG ATTCGTTCAT ATTCATCCGT 
121 CGCTTGCCCG GTCGCCACCC CGTCGATCCC TCACCCCGCC 
161 GTCCCCGGCA GHGCAGGTG GACTGCTAAT GTCATCGATG 
201 CAGTTCAGCA GCGTGCT6CC CCTG6AGGGC AAGGCGTGCG 
241 TTTCCCCAGT CAGGAGAGAG GGATCGGCCT GCGAGCGCCT 
281 CAA6ATCGGG 6ACAGCAGCA GCATCAGGCA CGAGAGAGCG 
321 TCCAGGAGGA TGTGCAACGG C6GCGCAGGG GCCCCGCCGC 
361 CACCGGTGCG CAGTGCGTGC TCACCTCCGA CGCCAGCCC3 
401 GCCGACACCC HGnCTCCG GACGTCCHC CGGAGGAATT 
ACGCCGATCC GAACGAG6TC GC6GCCGTCG GTCGCGGCCG 
TCATACTCGG CGGCGGCACC GGGACTCAGC TCTTCCCGCT 
CACAAGCACA AGGGCCACAC CTGCTGTTCC TATTGGAGGA 
TGHACAGGC TCATCGATAT TCCCATGAGC AACTGCTTCA 
601 ACAGTGGCAT CAACAAGATA 7TC6TCATGA CCCA6TTCAA 
CTCGGCATCT CTCAATCGCC ACAHCACCG CACCTACCTC 
GGCGGGGGAA TCAATHCAC TGATGGATCT GHGAGGTAT 
TGGCCGCGAC ACAAATGCCT G6GGA6GCTG CTGGATGGTT 
CCGCGGAACA GCGGATGCCG TCAGAAAATT TATCTGGGTG 
801 CTTGAGGACT ACTATAAGCA TAAATCCATA GAGCACATTT 
TGATCTTGTC GGGCGATCAG CTTTATCGCA TGGATTACAT 
GGAGCTTGTG CAGAAACATG TG6ATGACAA TGCTGACATT 
ACTTTATCAT GTGCCCCTGT TGGAGAGAGC CGGGCATCTG 
AGTACGGGCT AGTGAAGHC GACAGTTCAG GCCGTGTGAT 
1001 CCAGTTTrCT GAGAAGCCAA AGGGCGACGA TCTGGAAGCG 
ATGAAAGTGG ATACCAGTTT TCTCAATTTC GCCATAGACG 
ACCCTGCTAA ATATCCATAC AHGCTTCGA TGGGAGTTTA 
TGTCTTCAAG AGAGATGTTC TGCTGAACCT TCTAAAGTCA 
AGATACGCAG AACTACATGA CTTTGGGTCT GAAATCCTCC 
1201 CGAGAGCTCT GCATGATCAC AAT6TACAG6 CATATGTCTT 
CACTGACTAC TGGGAGGACA TTGGAACAAT CAGATCCTTC 
TTCGATGCGA ACATGGCCCT CTGC3AACAG CCTCCAAAGT 
TTGAATTTTA TGATCCAAAA ACCCCCTTCT TCACTTCGCC 
TCGGTACTTA CCGCCAACAA AGTCAGACAA GTGCAGGATC 

ug: aaagaagcga tcatttcgca CGGCTGCTTC HGCGTGAA' 
gcaaaatcga gcactccatc atcggcgttc ghcacgcct 
aaactccgga agcgagctca agaacgcgat gatgatgggc 

GCGGACTCGT ACGAGACCGA GGACGAGATC TCGAGGCTGA 
tgtctgaggg CAAGGTTCCC ATCGGCGTCG GGGAGAACAC 
1601 aaagatcag: AACTGCATCA tcgacatgaa CGCGAGGATA 
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GGAAGGGACG TGGTCATCTC AAACAAGGAG GGG6TGCAAG 
MGCCGACAG GCCGGAGGA;\ GGGTACTACA TCAGGTCCGG 
GATCGTGGTG ATCCAGAAGA ACGCGACCAT CAAGGACGGC 
ACCGTCGTGT AGGGCGTGCC GGGTCGGCGC GACGGGGHC 

1801 tgcgacaacc tgtgcgctgc gtcggtcgtc atcatcttct 
caaactccgg gactgaagaa gtgatccggg gacgggagac 

GTTTGAAGCT TGAA.TGACTG AGAC7G.AAAG TGAAGGCGCA 

gcagaggcag gcagcattag tagtaagtag taagt.aagta 

6CAGTG6AAC AAAGTAATAG TCGTTCGTTT TTCCCCTGTA 
2001 ATAAATAAGA GGCTGTGTGT TGAGGTA^^AA AAAAAAA 
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SEQUENCE: Nucleotide sequence of a cDNA encoding the smal' suDumt of AOP- 

glucose pyroDhosonorvlase from barley seed enaosoerm ;beps) 
SEQUENCE TYPE: NUCLEIC ACID 
MOLECULE TYPE; DNA 
ORIGINAL SOURCE: SARLEY 
SEQUENCE LENGTH: 1822 
STRANDEDNESS : DOUBLE 
TOPOLOGY: LINEAR 

COMMENT. The "." a: 1569 cenotes a purine. 



AAAAGTGAAC TCACACATCA CTCAATATCT ATATCCHCC 
ATTTTATATC CCTCGGTGAT GGATGTACCT TTGGCATCTA 
AAGHCCCTT GCCCTCCCCT TCCAAGCATG AACAATGCAA 
CCrrTATAGT CATAAGAGCT CATCGAAGCA TGCAGATCTC 
AATCCCCATG CTATTGATAG TGHCTCGGT ATCATTCHG 

201 6AGGTGGTGC AGGGACTAGA TTGTATCCCC TGAC6AAGAA 
GCGTGCAAAG CCTGCAGTGC CATTGGGTGC CAACTACAGG 
CTTATTGATA TTCCTGTCAG TAATTGTCTG AACAGCAACA 
TATCAAAGAT CTATGTGCTT ACACAGTTCA ACTCAGCHC 
TCTTAATCGT CATCTCTCAC GA6CCTATGG GAGCAACAH 

401 GGAGGHACA AGAATGAAGG ATTTGTTGAA GTCCHGCTG 
CACAGCAGAG CCCAGATAAC CCTGACTGGT TCCAGGGTAC 
TGCAGATGCT GTAAGGCAGT ACTTGTGGCT ATTCGAGGAG 
CATAATGHA TGGAGTATCT AAHCTTGCT G6AGATCACC 
TGTACCGAAT GGACTATGAA AAGTTTAnC AGGCACACAG 

601 AGAAACGGAT GCTGATATTA CTGTTGCTGC CTTGCCCATG 
GATGAGGAAC GTGCAACTGC ATTTGGCCTT ATGAAAATCG 
ATGAA6AAGG GAGGATAATT GAATTCGCAG AGAAACCAAA 
AGGAGAACAG TTGAAAGCTA TGATGGTTGA TACGACCATA 
CTTG6CCTTG AAGATGCGAG GGCAAAGGAA ATGCCHATA 

801 TTGCTAGCAT GG6TATCTAT GTTATTAGCA AACATGTGAT 
GCTTCAGCTT CTCCGTGAGC AATTTCCTGG AGCTAATGAC 
TTCGGAAGTG AAGHATCCC TGGTGCAACT AGCACTGGCA 
T6AGGGTACA AGCATACCTA TACGACGGTT ACTGGGAAGA 
TATTGGTACA AHGAGGCAT TCTATAATGC AAATTTGGGA 

iOOl ATTACCAAAA AACCAATACC T6ATTTCAGT TTCTATGACC 
GTTCTGCTCC CATTTACACA CAACCTCGAC ACTTGCCTCC 
TTCAAAGGTT CTTGATGCTG ATGTGACAGA CAGTGTAA.TT 
GGTGAA6GAT OTGHATTAA AAACTGCAAG ATACACCAH 
CAGTAGTTGG ACTCCGTTCC TGCATATCTG AAGGTGCAAT 

1201 AATAGAGGAC ACGTT6CTAA TGGGTGCG6A CTACTATGAG 
ACTGAAGCTG ATAAGAAACT CCTT3CTGAA AAAGGTGGCA 
TTCCCATTGG TATTGGAAAG AATTCACACA TCAAAAGAGC 
AATCAITGAC AAGAATGCTC GTATTGGAGA TAACGTGAT3 
ATAATCAATG TTGACAATGT TCAAGAAGCG GCGAGGGAGA 

1401 CAGATGGATA CTTCATCAAA AGTGGCATCG TAACTGTGAT 
CAAGGATGCT TTACTCCCTA GTGGAACAGT CATATGAAGC 
AGATGTGAAA tgTATGCCAA AAGACAGGGC TACTTGCGTC 
AGTCTGGAAT CAACCAACAA GGCCGCGAAG GAGATCATAA 
AATAAAAA..G GAGTGCCATG CGA3TCACTT CTACACCCr 

1601 TTCCCCCCTT GATGTAHAG GAACTGTGAT GTACAAGCAA 
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CTGTGATGCA CTTACGCGAA GTGCCCCTGG ATTCAGCTTT 

CTCTTTGCn GTAACTGGn TCCA6CAGAC CATGCTATTT 

GTTGTAT6GT TCGTGCAAAA CCTTGCGATG CTTTATATAT 

GCTTTATATA TAAACAAGAT GAA.TCCCCGC GCGTTGCTGC 
2001 GGCACAAAAA AAAAAAAAAA AA 
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Q-GLUCAN LYASE COOING SEQUENCE 

SEQUENCE TYPE: NUCLEIC ACID 

MOLECULE ~YPE. ONA (GENOMIC) 

ORIGINAL SOURCE: FUNGALLY INFECTED ALGAE 

SEQUENCE LENGTH: 3267 3P 

STRANDEDNESS DOUBLE 

SEQUENCE: 

:0 • 20 30 40 50 50 

1(11 

1 ATGTT7TCAA CCCTTGCGT" TGTCGCACCT A6T6CGCTGG GAGCCAGTAC CCG^^GGG 
61 GCGGAGGTCA GGTCAAATGT TCGTATCCAT TCCGCTTTTC CAGCTGTGCA CACAGCTACT 
121 CGCAAAACCA ATCGCCTCAA TGTATCCATG ACCGCATTGT CCGACAAACA AACGGC7ACT 
181 GCGGGTAGTA CAGACAATCC GGACGGTATC GACTACAAGA CCTACGATTA CGTCGGAGTA 
2^1 'GGGGmCA GCCCCCTCTC CAACACGAAC TGGTTTGCTG CCGGCTCr: TACCCCGGGT 
301 GGCATCACTG ATTGGACGGC TACAATGAAT GTCAACTTCG ACCGTATCGA CAATCCGTCC 
361 ATCACTGTCC AGCATCCCGT TCAGGTTCAG GTCACGTCAT ACAACAACAA CAGCTACAGG 
421 GTTCGCTTCA ACCCTGATGG CCCTAnCGT GATGTGACTC GTGGGCCTAT CCTCAAGCAG 
481 CAACTAGATT GGATTCGAAC GCAGGAGCTG TCAGAGGGAT GTGATCCCGG -iATGACTTTC 
541 ACATCAGAAG GTTTCTTGAC TT7TGAGACC AAGGATCTAA GCGTCATCA^ CTACGGAAAT 
601 TTCAAGACCA GAGTTACGAG AAAGTCTGAC GGCAAGGTCA TCATGGAAAA TGATGAAGTT 
661 GGAACTGCAT CGTCCGGGAA CAAGTGCCGG GGATTGATGT TCGTTGATAG AHATACGGT 
721 AACGCTATCG CTTCCGTCAA CAAGAACTTC CGCAACGACG CGGTCAAGCA GGAGGGATTC 
781 TATGGTGCAG GTGAAGTCAA CTGTAAGTAC CAGGACACCT ACATCTTAGA ACGCACTGGA 
841 ATCGCCATGA CAAATTACAA CTACGATAAC TT6AACTATA ACCAGTG6GA CCHAGACCT 
901 CCGCATCATG ATGGTGCCCT CAACCCAGAC TAmTATTC CAATGTACTA CGCAGCACCT 
961 IGGHGATCG HAATGGATG CGCCGGTACT TCGGAGCAGT ACTCGTATGG ATGGTTCATG 
1021 GACAATGTCT CTCAATCTTA CAT6AATACT G6AGATACTA CCTGGAATTC TGGACAA6AG 
1081 GACCTGGCAT ACATGGGCGC GCAGTATGGA CCATTTGACC AACATTTTGT TTACGGTGCT 
1141 GGGGGTGGGA TGGAATGTGT GGTCACAGCG TTCTCTCTTC TACAAGGCAA GGAGTTCGAG 
1201 AACCAAGTTC TCAACAAACG TTCAGTAATG CCTCCGAAAT ACGTCTTTGG TTTCTTCCAG 
1261 GGTGTTTTCG GGACTTCTTC CTTGTTGAGA GCGCATATGC CAGCAGGTGA GAACAACATC 
1321 TCAGTCGAAG AAAHGTAGA AGGTTATCAA AACAACAAH TCCCTTTCGA GGGGCTCGCT 
1381 GTGGACGTGG ATATGCAAGA CAACITGCGG GTGTTCACCA CGAAGGGCGA ATTTTGGACC 
1441 GCAAACAGGG TGGGTACTGG CGGGGATCCA AACAACCGAT CGGTTTnGA ATGGGCACAT 
1501 GACAAA.6GCC nGTTTGTCA GACAAATATA ACHGCTTCC TGAGGAATGA TAACGAGGGG 
1561 CAAGACTACG AGGTCAATCA GACGHAAGG GAGAGGCAGT T6TACACGAA GAACGACTCC 
1621 CTGACGGGTA CGGATTTTGG AATGACCGAC GACGGCCCCA GCGATGCGTA CATCGGTCAT 
1681 CTGGACTAT6 GGGGTGGAGT AGAATGTGAT GCACTTFTCC CA6ACTGGGG ACGGCCTGAC 
1741 GTGGCCGAAT GGTGGGGAAA TAACTATAAG AAACTGTTCA GCAHGCTC CGACTTCGTC 
1801 T6GCAAGACA TGACTGTTCC AGCAATGATG CCGCACAAAA TTGGCGATGA CATCAA^GTG 
1861 AAACCG6ATG GGAATTGGCC GAATGCGGAC 6ATCCGTCCA ATGGACAATA CAACT3GAAG 
1921 ACGTACCATC CCCAAGTGCT TGT.AACTGAT ATGCGHATG A6AATCATGG TCGGG^ACCG 
1981 ATGGTCACTC AACGCAACAT TCATGCGTAT ACACTGTGCG AGTCTACT^G GAA.GGAAGGG 
2041 ATCGTGGAAA ACGCAGACAC TCTAACGAAG TTCCGCCGTA GCTACATT^T CAGTCGTGGT 
2101 GGTTACATTG GTAACCAGCA "nTCGGGGGT ATGTGGGTGG GAGACAACTC TACTACA'CA 
2161 AACTACATCC AAATGATGAT TGCCAACAAT ATTA.ACATGA ATATGJCTTG CTTGCC'r: 
2221 GTCGGCTCCG ACATTGGAGG ■^i iCACCTCA TACGACAATG AGAATCAGCG A-CGCCGiG^ 
2^81 ACCuGGGACT iGATGGTGAG GTATGTGCAG GCGGGCTGuC iGiiGC^GiG u • .AoGAAc 
2341 CACTATGATA GGTGGATCGA GTCCAAGGAC CACGGAAAGG ACTACCAGGA GC'GTACATG 
2401 I -TCCGAATG AAATGGATAC GTTGAGGAAG TTCG:iGAAT TCCGTTA;CG GGCAGGAA 
2461 GTGTTGTACA CGGCCA^GT^ CCAGAATGCG GCTTTCGGAA AGCCGATT-" C-AGGC'GCT 

2§2i tcgatgtaca ataacgact: aaacghcgc agggcgcaga acgatca~ ::~:"tggt 

2581 GuACATGATG GAiAT\^3CAi ■ v. i vjCGCu Cv,Tci loTui uGGAGAA. jAC-^AA'-vJ^ 
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2641 GAATTGTACT TGCCCGTGCT 6ACCCAATGG TACAAATTCG G1CCCGAC7T TGACACCAAG 
2701 CCTCTGGAAG GAGCGATGAA CGGAGGGGAC CGAATHACA ACTACCCTGT ACCGCAAAGT 
?761 GAATCACCAA TCTTCGTGAG AGAAG6TGCG AHCTCCCTA CCC3CTACAC GT-QAACGGT 
2821 GAAAACAAAT CATTGAACAC GTACACGGAC GAAGATCCGT TGGTGTTTGA AGTATTCCCC 
2881 CTCGGAAACA ACCGTGCCGA CGGTATGTGT 7ATCTTGATG ATGGCGGTGT GACCACCAAT 
2941 GCTGAAGACA ATGGCAAGT7 C^CTGTCGTC AAGGTGGCAG CGGAGCAGGA TGGTGGTACG 
3001 GAGACGATAA CGTTTACGAA TGATTGCTAT GAGTACGTr TCGGTGGACC GTTCTACGT 
3061 CGA6TGCGCG GGGCTCAGTC GCCGTCGAAC ATCCACGTGT CTTCTGGAGC GGGTTCTCAG 
3121 GACATGAAGG TGAGCTCTGC CACTTCCAGG GCTGCGCTGT TCAATGACGG GGAGAACGGT 
3181 GATTTCTGOG TTGACCAGGA GACAGATTCT CTGTGGCTGA AGHGCCCAA CGnGT~:TC 
3241 CCGGACGCTG TGATCACAAT TACCTAA 
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a-GLUCAN LYASE CODING SEQUENCE 

SEQUENCE TYPE: NUCLEIC ACID 

MOLECULE TYPE. DNA (GENOMIC) 

ORIGINAL SOURCE: ^UNGALLY INFECTED ALGAE 

SEQUENCE LENGTH: 3276 6? 

STRANDEDNESS: DOUBLE 

SEQUENCE: 

10 • 20 



3C 



40 



- "1 



0^ 



1 

61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2231 
2341 
2401 
2461 
2521 
2581 



ATGTATCCAA 
GTGGGCATTT 
GTGCGCAAAA 
GHACTGGAG 
GTGTGGCGCT 
GGCGATATTG 
TCCnCACTC 
AGGGHCGCT 
CAGCAACTAA 
TTCACAAAAG 
AATTTTAA6A 
GTGCC6GCAG 
GGCACTGCCA 
TTCTATGGTG 
TACATCTTAG 
AACCAGTCAG 
TATTTTGCAG 
TCGTACG6AT 
TGGAACTGTG 
CATTTTGTGT 
CAAGGCAAAG 
GTGnTGGTT 
6AGGGTG6TA 
CCTTTAGAGG 
AA6ATTGAAT 
GT6TTTGAAT 
AGAAACGACA 
TACACGAA6A 
GATGCGTACA 
GACTGGGGTC 
ATTGGTCTGG 
GGCGACGCAG 
GGACGATACA 
AATCATGGAA 
TCTACGAGGA 
TATAHATCA 
GACAACTCTT 
ATG^CTTGCC 
"GTGTC 
TTCAGAAACC 
CTGTACATGT 
1 GGCAGGAGG 
AAGGCAGCTT 



CCCTCACCTl 
TTAGGTCACA 
6CAACCGCCT 
GGAAGGACAA 
TCGACCCCCT 
ACGACTGGA.C 
TCGAGAAACC 
TCAACCCTGA 
ATTGGATCCG 
AAGGTTTCTT 
CTAGAGTTAC 
GATCGHAGG 
TCGCTTCCGT 
CAGGAGAAGT 
AACGAACTG6 
ATCTTAHGC 
CACCTTGGGT 
GGTTTATGGA 
6AGAGGAGAA 
ATG6TGATGG 
AGTHGAGAA 
ACTTTCAGGG 
ATAACATCTC 
GGTTAGCCGT 
"GGACGGC 



GGGCACATGA 
ACGGCGGGGC 

ATGACTCACT 
TTG6ACATCT 
GACCGGGTGT 
ATTTCGTCTG 
TCGATACGAG 
ATT6GAAATC 
GGGAACCGAT 
AGGAA6GGAT 
GTCGTGGAGG 
CCTCCCAAAG 
TTCCACTAGT 
CCGGGGATCT 
ACTATGGTAG 
ACAAGGACGA 
TGHGTACAC 
CCATu I ACGA 
GACACGATGG 



CATTCTTATT 

CAATGTATCC 

CCCGGACAAT 

CAGCAA.TACG 

GGCGACAATG 

GGTTCAGGIT 

TGGTCCTATT 

GAAGCAGGAG 

GAAATTTGAG 

GAGGAAGAGG 

6AACAAGTGC 

TAATGAAAAT 

AAACTGCGAG 

AATCGCCATG 

TCCAGGATAT 

AGTTGTTAAG 

TAAT6TCTCC 

CTTGGCATAC 

AGATGGTCTT 

CCAAGTTCTG 

AGTCTTTGGG 

TGTTCAAGAG 

AGATGTGGAT 

AAATAAGGTA 

CAAAGGCCTT 

AGATTACGAA 

GACGAACACT 

GGACTATGGT 

GGCTGAATGG 

GCAAGACATG 

ATCACCHAC 

HACCATCCA 

GTTCACTCAA 

TGTTGCAAAT 

TTA.CAi iGGC 

ATACCTCCAA 

TGGGTCCGAC 

AATGGTAAGA 

GTTGGTCGAG 

GA I GuC I -CA 

TGCTA I G i AC 

C.AAC3ACAGA 

iTATCG'Ar 



A.GTGCGCTAG 
CATTCGGTTG 
ATGTCCGCTT 
ATCAATTACA 
AACTGGTTTG 
AATGTGAACT 
CAGGTCACGT 
CGCGAT6TGG 
CAGTCGAAGG 
ACCAAGGATC 
GATG6AAAAG 
CGGGGATTGA 
TACCGCAACG 
TTTTGGGACT 
ACAAATTACA 
CCTTCCGACC 
GGATGCAGTG 
CAAACTTACA 
ATGGGAGCAC 
GAGGATGTTG 
AACAAACGTG 
ATTGCTTCCT 
AHGTCGAAG 
ATGCAACAAG 
GGCACCGGGG 
GTATGTCAGA 
GTCAATCAGA 
AACTTCGGAA 
GGCGGAGGGA 
T6GGGTGATA 
ACAGTTCCAG 
GGCTGGCCGA 
CAAGTTCTCG 
CGCAATATGC 
GCAGACACTC 
AACCAGCATT 
ATGATGATCG 
Ai iGGAGGi I 
1 'v-o I oCAGu 
GGCAAGCAAG 
7TGAGAAAAT 
CAGAATGCGG 
,AACG- iCGCG 
Ti a I G ! GCA(„ 



GGGCCAGAAC 
TTCCAGCGGT 
TGTTCGACAA 
CCACHATGA 



GCGTC 1 AGCT 

ACCGAC' 
CTACGTCCCT 



iTTTG CTGCCGGATC TTCCACT.::( 



TCGACCGTAT 

CATACAAGAA, 

ATCGTGGGCC 

GGTTTGATCC 

TGAACGTTAT 

GGATCAT6GA 

TGTTT6TCGA 

ATCCCGACAG 

CCGAACAAAA 

ATTATGACAA 

CGAACTTCTA 

GCAACAGCGA 

TGAATACTGG 

AGT6CGGTCC 

TCCAAGCGTT 

CCGTAATGCC 

TGTTGAGAGA 

GTTACCAAAG 

ATTTGCGCGT 

GAGACTCGAA 

CGAATGHAC 

CATTGAGGGA 

CTACCAACGA 

AHGTGATGC 

ACTACAGCAA 

CTATGATGCC 

ATGAGAATGA 

TAACTGATAT 

ATGCGTACAC 

TAACGAAGTT 

TTGGAGGAAT 

CGAACATCGT 

TTAC ; i CGT-i 

CvjuG 1 1 UL. ' ' 

AGGGAAAATA 

TCATTGAAT~ 

C I i i ClaUw^.-^ 

gcgcacagga 

C I GTTG I G 1 3 



CGACAATCCA 
CAATTGTTTC 
TATCCTCCA6 
TAAGATGGGC 
CATATATGGC 
GAATAATGAA 
CAGGHGTAC 
GAAAGAGGGG 
CAGGAACAAG 
CTATAACTAC 
CATTCCCATG 
TGAACAGTAC 
TGGTACnCC 
ATTTGACCAA 
CTCTCTTCT6 
TCCGAAATAT 
GCAAAGACCA 
CAATAACTTC 
GTTCACCACG 
TAACAAGTCG 
TTGCTTCTT6 
6AAGGGTTTG 
CGGGCCGAGC 

AcmrcccA 

GCTCTTCAAA 
ACAC 

TCCTTCGAAC 

ACTCTGTGAA 

u I GGG ' ouA 
CAACA73A/\C 
TGATGGACGA 

CTATCAAGAA 



' - w w J'^ 



wo 96^9415 



FIGURE 8 CONTINUED 

2641 ACCAGTCGCG ATCTGTACn GCCTGTGCT6 

2701 6ACACCAAGC GCCTGGATTC TGCGHGGAT 

2761 CCACAAAGCG ACTCTCCGAT ATTTGTGAGG 

2821 TTGGACGGTT CGAACAAGTC AATGAACACG 

2881 GTA1TCCCTC TTGGAAACAA CC3T6CCGAC 

2941 ACTACAGATG CTGAGGACCA IGGCAAAnC 

3001 GGTGTTACGA CGACGATCAA. GTTTGCGTAT 

3061 TTCTACGTTC GAATCCGTAA TCTACGACT 

3121 GG'GAAGAGG ACATGACACC GACCTCTGCG 

3181 GGTGrrGGAG AATACTGGGC TGACAATGAT 

3241 CTGGTTCTGC AAGACGCTGT GATTACCAn 



PCT/EP96/01008 

W3L 



ACCAAATGGT ACAAA.TTCGG CCCTGACTAT 
GGAGGGCAGA TGAHAAGAA CTATTCTGTG 
GAAGGAGCTA T7CTCCCTAC CCGCTACACG 
TACACAGACA AAGACCCGTT GGTGTTTGAG 
GGTATGTGTT ATCTTGATGA TGGCGGTATT 
TCTGHATCA ATGTCGAA6C CTTACGGAAA 
GACACr'ATC AATACGTA" T3ATGGTCCA 
GCATCAAAAA HAACGIG": TTCTGGAGCG 
AACTCGAG6G CAGCTTTGT CAGTGATGGA 
ACGTC7TCTC TGTGGATGAA GTTGCCAAAC 
ACGTAG 



wo 96/29415 PCT/EP9MH008 

FIGURE 9 

a-GLUCAN LYASE CODING SEQUENCE 
SEQUENCE TYPE: NUCLEIC ACID 
MOLECULE TYPE; ONA (GENOMIC^; 
ORIGINAL SOURCE: FUNGUS 
SEQUENCE LENGTH: 3201 BP 
STRANDEDNES5 : DOUBLE 
SEQUENCE: 

10 20 30 iO 50 50 

ATGGCAGGAT TTTCTGATCC TCTCAAC~T TGCAAAGCAG AA.GACTACTA CAGTGHGCG 
70 80 90 100 110 120 

CTAGACT6GA AGGGCCCTCA AAAAATCATT GGAGTAGACA CTACTCCTCC AAAGAGCACC 

130 140 150 160 170 180 

AA6TTCCCCA AAAACTGGCA TGGAGTGAAC TTGAGATTCG ATGATGGGAC "^AGGTGTG 

190 200 210 220 230 240 

GlTCAGnCA TTAGGCCGTG CGTTTGGAGG GTTAGATACG ACCCTGGTTT CAAGACCTCT 

250 260 270 280 290 300 

6ACGAGTATG GTGATGAGAA TACGAGGACA ATTGTGCAAG ATTATAT6AG TACTCTGAGT 

310 320 330 340 350 360 

AATAAATTGG ATACTTATAG AGGTCTTACG TGGGAAACCA AGTGTGAGGA TTCGGGAGAT 

370 380 390 400 410 420 

TTCTTTACCT TCTCATCCAA GGTCACCGCC GTTGAAAAAT CCGAGCGGAC CCGCAACAAG 

430 440 450 460 470 480 

6TCGGCGATG GCCTCAGAAT TCACCTATGG AAAAGCCCTT TCCGCATCCA A6TA6TGCGC 

490 500 510 520 530 540 

ACCTTGACCC CTTTGAAGGA TCCTTACCCC ATTCCAAATG TAGCCGCAGC CGAAGCCCGT 

550 560 570 580 590 600 

GTGTCCGACA A6GTCGTTTG 6CAAACGTCT CCCAAGACAT TCA6AAAGAA CCTGCATCCG 

610 620 630 640 650 660 

CAACACAAGA TGCTAAAGGA TACAGTTCn GACATTGTCA AACCTGGACA TGGCGAGTAT 

670 680 690 700 710 720 

GTGGGGT666 GAGA6ATGGG AGGTATCCAG TTTATGAAGG AGCCAACATT CATGAACTAT 

730 740 750 760 770 780 

TTTAACTTCG ACAATATGCA ATACCAGCAA GTCTATGCCC AA.GGTGCTCT CGAHCTCGC 

790 300 810 820 830 840 

GAGCCACTGT ACCACTCGGA TCCCTTCTAT CTT6AT6TGA ACTCCAACCC GGAGCACAAG 

850 860 870 880 890 900 

AATATCACGG CAACCTTTAT CGATAACTAC TCTCAAATTG CCATCGACTT TGGAAAGACC 

910 920 930 940 950 960 

AACTCAGGCT ACATCAA.GCT GGGAACCAGG TATGGTGGTA TCGATTGTTA CGGTATCAGT 

970 930 990 1000 1010 1020 

GCGGATACGG TCCCGGAAAT TGTACGACTT TATACAGGTC TTGnOGACG TTCAAAGHG 

1030 1040 1050 1060 1070 1080 
AAGCCCAGAT ATATTC'CGG GGCCCATCAA GCCTGTTiTG GATACCAACA G5AAAGTGAC 
_ _1090 ^_liOO ^ 1110 ^1120 1130 I14C 

„ ..^^5 it" — ^'4-5 

\jAhj:uuA: j CAGGACGG C. I CAG.AAC ^ACCA^^CA ACC-wACACAl .^CC.AAw 
1210 1220 1230 1240 1250 1260 

^ 12/0 1280 1290 ^1300 ___1313 

CuTG ! I A 1 >^A uCA I \ AAcAA v^AGAGAGoG • ooA . ACAG \ A ^ ^uTC w - \ uA "joGAG . • GA^ 
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FIGURE 9 CONTINUED 

1330 1340 1350 1360 1370 1380 

AAAAAATACT TTATCATGGA C3ACAGATAT ACCGAGGGAA, CAAGTGGGAA TGCGAy^GGAT 

1390 1400 1410 1429 1430 I44C 

GTTCGGTACA TGTACTACGG TGGTGGTAAT AAGGTTGAGG "CGATCCTAA 7GATG~AAT 

1450 1460 1470 1480 1490 1500 

GGTCGGCCAG ACriTAAAGA C.AAC7ATGAC rrCCCCGCGA ACTTCAACAG CAAACAATiC 

1510 ^1520 ^ 1530 1540 1550 _ _ 1560 

CCC i ATCA I G GTGGTGTGAG C^ACuG i i A i GGGAACGG i A G i oCAGGT i : it ACx^C^GAl 

1570 1580 1590 1600 1610 1520 

CTCAACAGAA AGGAGGTTCG TATCTGGTGG GGAATGCAGT ACAAGTATCT CTTCGATATG 

1630 1640 1650 1660 1670 1680 

GGACT6GAAT HGIGIGGCA AGACATGAC"^ ACCCCA6C,V\ TCCACACATC ATATGGAGAC 

1690 1700 1710 1720 1730 1740 

ATGAAA6GGT TGCCCACCCG TCTAC^CGTC ACCTCAgACT CCGTCACCAA TGCCTCTGAG 

1750 1760 1770 1780 1790 1800 

AAAAAGCTCG CAATTGAAAC TTGGGC'CTC TACTCCTACA ATCTCCACAA A6CAACT7GG 

1810 1820 1830 1840 1850 1860 

CATGGTCTTA GTCGTCTCGA ATCTCGTAAG AACAAACGAA ACTTCATCCT CGG6CGTGGA 

1870 1880 1890 190O 1910 1920 

A6TTATGCCG GAGCCTATCG TTTTGCT6GT CTCTGGACTG GGGATAATGC AAGTAACTGG 

1930 1940 1950 1960 1970 1980 

GAAnCTGGA AGATATCGGT CTCTCAAGH CTTTCTCT6G GCCTCAATGG TGTGTGCATC 

1990 2000 2010 2020 2030 2040 

GCGGGGTCTG ATACGGGTGG TTTTGAACCC TACCGTGATG CAAATGGGGT CGAGGAGAAA 

2050 2060 2070 2080 2090 2100 

TACTGTAGCC CAGAGCTACT CATCA6GT6G TATACT6GTT CATTCCTCTT GCC6TGGCTC 

2110 2120 2130 2140 2150 2160 

AGGAACCATT A7GTCAAAAA G6ACAGGAAA TGGHCCAGG AACCATACTC GTACCCCAAG 

2170 2180 2190 2200 2210 2220 

CATCTTGAAA CCCATCCAGA ACTCGCAGAC C.AAGCATGGC TCTATAAATC CGTTTrGGAG 

2230 2240 2250 2260 2270 2280 

ATCTGTA66T ACTATGTGGA GCTTAGATAC TCCCTCATCC AACTACTTTA CGACTGCATG 

2290 2300 2310 2320 2330 2340 

TTTCAAAACG TaGiCGACGG TATGCCAATC ACCAGATCTA TGCTCHGAC CGATACTGAG 

2350 2360 2370 2380 2390 2400 

GATACCACCT TCTTCAACGA GA6CCAAAAG TTCCTCGACA ACCAATATAT GGCTGGTGAC 

2410 2420 2430 2440 2450 2460 

GACATTCTTG TTGCACCCAT CCTCCACAGT CGCAAAGAAA TTCCAGGCGA AAACAGAGAT 

2470 2480 2490 2500 2510 2520 

GTCTATCTCC CTCTTTACCA CACCTGGTAC CCCTCAAATT T3AGACCATG GGACGATCAA 

2530 2540 2550 2560 2570 2580 

GGAGTCGCTT TGGGGAATCC TGTCGAAGGT GGTAGTGTCA TCAAHATAC 7GCTAGGATT 

2590 2600 2610 2620 2630 2640 

GT7GCACCCG AGGATTATAA TCTC7TCCAC A,GCG7GGTAC CAG7CTACG7 7AGAGAGGG7 

2650 2660 2670 2680 2690 2700 

3CCA7CAiCC CGCAAAiCGA AG7ACGCCAA 7GGAC7GGCC AGGGGGGAGC CAACCGCATC 

_ 2710 ^2720 ^ Z:'30 ^ _ 2740 £50 2760 

AAG I • CAACA I CTACCCTGG AAAGGATAAG GAGTACTG • A i A i ■ i GA TGATGj iu > ' 
. .J"'*^' .^2780 _2790 ^ ^ ^2300 ^ _„2810 _J320 
AGCCG7GATA G7GCGCCGGA AGACC7CCCA CAG7ACAAAG AGACCCACGA ACAGCGAAG 

^ ^2830 2340 ^ ^ 2360 2860 ^^^^_2870 ^ _^288G 

G 1 HjAAoGCG CGGAAA7CGC AAAGCAGAii GGAAAGAAGA v.3uGiiACAA CATC CAGGA 

2890 2900 2910 292C 2930 2940 
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FIGURE 9 CONTINUED 

ACC6ACCCAG AAGCAAAGGG TTATCACCGC AAAGHGCTG TCACACAAAC GTCAAAAGAC 

2950 2960 2970 2980 2990 3000 

AAGACGCGTA CTGTCACTAT TGAGCCAAAA CACAATGGAT ACGACCCTTC CAAAGA3GTG 

3010 3020 3030 3040 3050 3060 

GGTGATTATT ATACCATCAT TCTTTGGTAC GCACCAGGT TCGATGGCAG CATCGTCGAT 

3070 3080 3090 3100 3110 3120 

GTGAGCAAGA CGACTGTGAA. 'GTTGAGGGT GGGGTGGAGC ACCAAGTTTA "AAGAACTCC 

3130 3140 3150 3160 3170 3180 

GATTTACATA CGGTTGTTAT CGACGTGAAG GAGGTGATCG GTACCACAAA GA6CGTCAAG 

3190 3200 
ATCACATGTA CTGCCGCTTA A 



wo 96/29415 PCT/i:P96/010a8 

FIGURE 10 

a-GLUCAN LYASE CODING SEQUENCE 
SEQUENCE TvpE NUCLEIC ACID 
MOLECULE T'^PE: ONA (GEMOMIC^, 
ORIGINAL SOURCE; -UNGuS 
SEQUENCE LENGTH: 3213 3P 
STRANDEDNE33 . DOUBLE 
SEQUENCE: 

^10 20 _J0 aO SO 60 

ATGGCAGGAi lATCCGACCC "CTCAAi; ;C ~3CAAAGCAG AGGACTACiA CGCTGCTGCC 
70 80 9G 100 110 120 

AAAGGCTGGA GTGGCCC'CA GAAGATCATT CGC'ATGACC AGACCCCTCC iCAGGGTACA 

130 140 159 160 170 180 

AAAGATCCGA AAAGCTGGCA TGCGGTAAAC :~CCTTC3 ATGACGGGAC TATGTGTG7A 

190 200 210 220 230 240 

6TGCAATTCG TCAGACCCT6 TGHIGGAGG GTTAGATATG ACCCCAGTGT CAAGACTTCT 

250 260 270 280 290 300 

GAT6AGTACG GCGATGAGAA TACGAGGACi ATTGTACAAG ACTACATGAC TACTCTGGTT 

310 320 330 340 350 360 

GGAAACTTGG ACATTTTCAG AGGTCHACG TGGGTTTCTA CGTTGGAGGA HCGGGCGAG 

370 380 390 400 410 420 

TACTACACCT TCAAGTCCGA AGTCACTGCC GTGGACGAAA CCGAAC66AC TCGAAACAAG 

430 440 450 460 470 480 

6TCGGCGACG GCCTCAAGAT TTACCTAT6G AAAAATCCCT TTCGCATCCA GGTAGTGCGT 

490 500 510 520 530 540 

CTCTTGACCC CCCTGGTG6A CCCTTTCCCC AHCCCAACG TAGCCAATGC CACAGCCCGT 

550 560 570 580 690 600 

GTGGCCGACA AGGTTGTTTG GCAGACGTCC CCGAAGACGT TCAGGAAAAA CTT6CATCCG 

610 620 630 640 650 660 

CAGCATAAGA TGTTGAA6GA TACAGTTCTT GATATTATCA AGCCGGGGCA CGGAGAGTAT 

670 680 690 700 710 720 

GTGG6TTGGG GAGAGATGGG AGGCATCGA6 TTTATGAAGG AGCCAACATT CATGAAHAT 

730 740 750 760 770 780 

nCAACTTTG ACAATATGCA ATATCAGCAG GTCTATGCAC AAGGCGCTCi TGATAGTCGT 

"90 800 810 820 830 840 

GAGCC3TTGT ATCACTCTGA TCCCTTCTAT CTCGACGTGA ACTCCAACCC A6AGCACAAG 

850 860 870 880 890 900 

AACATTACGG CAACCTTTAT CGATAACTAC TCTCAGATTG CCATCGACTT TGGGAAGACC 

910 920 930 940 950 960 

AACTCAGGC^ ACATCAAGCT GG3TACCAGG 7ATGGCGGTA TCGATTGTTa CGGTA^CAGC 

970 980 990 ^ ^1000 ^ 1010 _ ^ 102C 

GCGGATACGG TCCCGGAGAT TGTGCGACi' ^ATACTGGAC <iGiiGGGCG i iCGAAGTTG 

_^103C 1040 105C 1060 ^1070 ^^1080 

AAGC'^CAG;^ ; -<A: .u"^ljG AGCCCACCAA GC iGi ! '^TG gataccag^a ggaaagtgac 
109C HOC 1110 1120 1130 114C 

1150 1160 IIJC 1180 ^ U90 ^120C 

uAio ik*oA-^ . . ^•^ooAC'-A ■ • ' -^AGAACo • -*C^Au ' ^ -*u^woAt i a . ^ -^^^ \ AA ; 
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FIGURE 10 CONTINUED 
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CCTGTTATCA 
1330 

AAAAAGTAC" 
1390 

GTTCGATACT 
1450 

1510 
CCCTATCATu 

Id/ 'J 
CTTAACAGAG 

1630 
GGACTAGAGT 

1690 
ATGAAAGGGT 

1750 
AAAAAGCTCG 

1810 
CACGGTCHG 

1870 
AGTTACGCCG 

1930 
GAATTCTGGA 

1990 
GCGG6GTCTG 

2050 
TGCAGTCCGG 

2110 
AACCACTACG 

2170 
CTTGAAACCC 

2230 
TGCAGATACT 

2290 
CAAAAC6TGG 

2350 
ACGACCTTCT 

2410 
ATCCTT6TAG 

2470 
TATCTCCCTC 

2530 
GTCGCTTTAG 



.... 2590 
GCCCCA6AGG 
2650 
ATCATTCCGC 
2710 
TTCAATATCT 
2770 
CGCGATAGT-j 
2830 



GTATCAGAGA 

1340 
TCATCATGGA 

1400 

CrnTTACGG 

1460 
ACrTTGGAGA 
^^^^1520 
GTGGTGTGA6 

1580 
AGGAGGTTCG 
_ 1640 
I iGTATGGCA 

1700 
TGCCCACCCG 

1760 
CAATTGAAAG 

1820 
GTCGTCHGA 

1880 
GTGCCTATCG 

1940 
AGATTTCGGT 

2000 
ATACGG6TGG 

2060 
AGCTACTCAT 

2120 
TCAAGAAGGA 

2180 
ATCCAGA6CT 

2240 
GGGTAGAGCT 

2300 
TCGATGGTAT 

2360 
TCAATGAGAG 

2420 
CACCCATCCT 

2480 
TATTCCACAC 

2540 
GGAATCCTGT 

2600 
AHATAATCT 

2660 
AAATTCAGGT 

2720 
ACCCTGGAAA 

2780 
CACCA6ATGA 

2840 



7CGCCCGAA.T 

1350 
TGACAGATAT 

1410 
CG6TG3GAAC 

1470 
C.AAT7ATGAC 
_ ^^1530 
i i ACG8ATA » 

1590 
TATCTGGTGG 

1650 
A6ATATGACA 

1710 
TCTGCTCGTC 

1770 
TT666CTCTT 

1830 
GTCTCGTAAG 

1890 
TTTTGCTGGT 

1950 
CTCCCAAGTT 

2010 
TTTTGAGCCC 

2070 
CAGGTG6TAT 

2130 
CAGGAAAT6G 

2190 
CGCAGATCAA 

2250 
AAGATAHCC 

2310 
6CCACTTGCC 

2370 
CCAAAAGTTC 

2430 
CCACAGCCGT 

2490 
CTGGTACCCC 

2550 
CGAA6GTGGC 

2610 
CTTCCACAAC 
^_^2670 
ACGv^lAo I uu 

2730 
GGACAAGGAG 

2790 

2850 



GGGTACAGTA 
1360 

1420 
CC3GTTGAGG 
MSG 
7TCCCTACGA 
1540 
G3GAATGGCA 
1600 
GGATTGCAGT 
1560 
ACCCCAGCGA 
1720 
ACCGCCGACT 
1780 
TACTCCTACA 
1840 
AACAAACGTA 
1900 
CTCTGGACTG 
1960 
CTTTCTCTAG 
2020 
GCACGTACTG 
2080 
ACTGGATCAT 
2140 
TTCCAGGAAC 
2200 
GCATGGCTTT 
2260 
CTCATCCAGC 
2320 
AGATCTATGC 
2380 
CTCGATAACC 
2440 
AACGAGGTTC 
2500 
TCAAACTTGA 
2560 
AGCGTATCA 
__^J620 
0 1 oGTuCCGG 



2580 

Ai iGGCGAAG 

2740 
TATGTGACGT 

2800 

tac:gcgagg 

2860 



ccctcaatga 

1370 

CAA6' 

1430 
TTAACCCTAA 

1490 

AC 1 1 CAACTG 

1550 
CTCCAGG 1 1 A 

1610 

acgagtatct 

1670 
TCCAHCATC 

1730 
CAGTTACCAA 

1790 
ACCTCCATAA 

1850 
ACTTCATCCT 

1910 
GAGATAACGC 

1970 
GTCTCAATGG 

2030 
AGATTG6G6A 

2090 
TCCTTTTGCC 

2150 
CATACGCGTA 

2210 
ACAAATCT6T 

2270 
TCCTTTACGA 

2330 
TCHGACCGA 

2390 
AATATATGGC 

2450 
CGGGAGAGAA 
. 2510 
GACCGTGGGA 

2570 

actacactgc 

2630 
^CTACATCAG 
2590 



GuijATA I oA I 
138G 

I : \j ; i cl: 

■500 
CAAAGACT^C 

laoG 
C 1 ACCv, ■■ GAC 

^ 1520 

C I I CAji.TATG 

1680 
ATATGGA6AC 

1740 
TGCCTCTGAG 

1800 
AGCAACCTTC 

I860 
CGGACGTGGT 

1920 
AAGTACGTGG 

1980 
TGTGTGTATA 

2040 
GGAGAAATAT 

2100 
ATGGCnAGA 

2160 
CCCCAAGCAT 

2220 
TCTAGAAAH 

2280 
CTGCATGTTC 

2340 
TACTGAGGAT 

240C 
TGGTGACGAC 

2460 
CAGAGATGTC 

252G 
CGATCAGGGA. 

caggattgtt 

254C 
AGAGGG I GCC 
2"CG 



_^_2750 

acc ■ I gatga 

2310 
2S73 



' AGC 



23S- 
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FIGURE 10 CONTINUED 

GAAGGCAAAG ACGTCCAGAA GCAACTTGCG GTCATTCAAG GGAATAAGAC TAATGACTTC 

2890 2900 2910 2920 2930 2940 

TCCGCCTCCG GGAHGATAA GGAGGCAAAG GGTTATCACC GCAAAGTTTC TATCAAACAG 

2950 2960 2970 2980 2990 3000 

GAGTCAAAAG ACAAGACCCG TACTGTCACC ATTGAGCCAA AACACAACGG ATACGACCCC 

3010 3020 3030 3040 3050 3060 

TCTAAGGAAG T7GGTAATTA 7TATACCATC ATTCTTTGGT ACGCACCGGG CTTGACGGC 

3070 3080 3090 3100 3110 3120 
AGCATCGTCG ATGTGAGCCA GGCGACCGTG AACATCGAGG GCGGGGTGGA ATGCGAAATT 

3130 3140 3150 3160 3170 3180 
TTCAAGAACA CCGGCTT6CA TACGGTTGTA GTCAACGTGA AAGAGGTGAT CGGTACCACA 

3190 3200 3210 

AAGTCCGTCA AGATCACTTG CACTACCGCT TAG 
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